SOCIAL RESEARCH / Strategy and Tactics 



Strategy 
and Tactics 

BERNARD S. PHILLIPS 

Boston University 


HLSU. CENTRAL LIBRARY 



THE MACMILLAN COMPANY, NEW YORK 
COLLIER-MACMILLAN LIMITED. LONDON 



CCopynjbl, Btrnard S. Ph>lli;>s, 1966 
All rights T«««rve4. N« part ot this booh may 
be reproduced or transmitted in any form or 
by any means, electronic or mechanical, in* 
eluding photocopying, recording or by any 
TOformatiou storage and retrieval systom, 
without permission in writing from the Pub- 


Fourrh PnnCing. 1968 

Library of Congress catalog card number 66-20823 
THE MACMILLAN COMPANY. NEW YORK 
COLLIER-MACMILLAN CANADA, LTD .TORONTO, ONTARIO 


Printed in the Uiuted States of America 



To: Marjorie and David 



Preface 


There are (wo fundamenfal conceptions R-hich form the basis for the ideas 
presented hi this volume: that theoiy constitutes the most important research 
tool available to the scientist, and that rtscarch methods may he fruitfully con- 
ceived of as strategies and tactics adopted by a community of scientists. 

It has long been acknowledged by behavioral scientists that theory and 
method should be integrally related, but efforts in this direction have been 
few. In this volume I try to contribute toward this end uiihin the context of 
a presentation of the standard topics in a basic course on research methods. 
After a discussion of the elements of theory and a variety of research strate- 
gics, theory is utilired to explore the topics of data collection, measurement, 
the analj-sis of data, and the doxlopmcm of formal models. The approach 
taken is the use of theory os a tool for obtaining an understanding of the 
measurement situation itself as well as uhat goes on outside of it. 

The conception of a community of scientists who are communicating with 
one another in an effort to explain and predict phenomena serves to provide 
a dynamic view of the research process. From this perspective the choices that 
each investigator makes may be ci-aloated in terms of how effectivTly they 
aid this community in reaching the goals of science. This principle will serve 



PREFACE 


to liberate the investigator from undue reverence for any particular method. 
It will encourage him to make maMmal use of h« knowledge of the particular 
research situation facing him and to develop and rely on his own ideas. 

I have developed one of the research illustrations in considerable detail and 
used it to help provide continuity throughout the volume. It is held up not as 
a model of ideal research procedure, but rather as a realistic picture of a series 
of choice situations in which the researcher finds himself, uncertain that his 
decisions will prove to be correct. Regardless of this lack of certainty, how- 
ever, there exist various criteria for decision making which have served as 
useful guides for many researchers, and these are elaborated. 

I have attempted to combine in this volume a treatment of both strategy 
and tactics. Broad orientations can be most elTccllve when their detailed impli- 
cations for research decisions ate spelled out. Thus, for example, the special 
problems and techniques involved in experimentation, interviewing, observa- 
tion, the use of documents, and simulation must be taken up, as well as some 
of the flavor and culture of methodology. Moreover, such information does 
not constitute mere elaboration It is within the context of such particulars that 
the fandameatal mechanism of the research process are most clearly seen. 

In Part One, "Theory and Method," a number of ideas selected from the 
philosophy of science are used to provide an introduction to the nature of 
theory and the dynamics of iu construction. Part Two, “Data Collection,” 


covers a wide range of quantitative and qualitative procedures. The aim of 
Part Three, “Measurement and Scaling,” is to combine a portrayal of the 
axiomacic basis for measurement and seating with descriptions of scaling pro- 
cedures. The emphasis in Part Four, "Analysis of Data,” is on integration of 
some core ideas from statistics within a more general analytical framework. 
Part Five, "Applications of Logic and Mathematics,” is designed to introduce 
tlie reader to a field of growing importance in social science. 

For their influence on the development of my ideas on the philosophy of 
science as well as (he sociology of science. I owe a great intellectual debt to 
Sidney Rome. Wesley Salmon, and Marx 
experiences m social research were made more mean- 
wi ® of my association with Kurt Back, 

and Robin Williams. I would also like to 


B. S. P. 



Contents 


PART ONE: THEORY AND METHOD 1 

Chapter i. Imroduction to Rnearch Methods 3 

1.1 General Orientation 3 

1.2 Introduction 10 a Research lUustroihn 5 

1.3 Overview of {he Siudy 6 

1.4 Phase One 8 

\.S Phase Two 12 

Selected References for Further Study 25 

Chapter 2. Elements of Inquiry 27 

2.1 The Languttse of Science 27 

w 2.2 Concepts 32 

,2.3 Propositions 37 

Theories 42 

Selected References for Further Study 47 

Chapter 3. The Process of Inquiry 49 

3.1 Cools of Behavioral Science 49 

3.2 Context of Discovery 56 

3.3 The Human Equation 61 

3.4 Research Strategy 65 

Selected References for Further Study 70 



CONTENTS 


part TWO: DATA COLLECTION 

Chapter 4 Principles of Data CoDection 

4.1 DefiniliotoflheProbiein 

4.2 Research 

4 3 The Obser\er end the Ohser/ed 

4 4 Degree of Struelure 

Selected References for Further Study 

Chapter S. The Experiment 

5 1 Introduction 

•r^.2 Mitrs Experimental Methods 

‘T.3 The Nature of the Experiment 

5.4 Experimental Design 

Selected References for Further Study 

Chapter 6. Interviews, Questtonnaires. and Surveys 
61 Experiments and Surveys 
'6.2 Interviewer and Respondent 

6.3 Survey /4rfminf«ra»M>n 

• 6 4 Conjfrucfion of Questionnaires and Jmerview Schedules 
•6.S Proiective TechniquesandOther Indirect Methods 
Selected References for Further Study 

Chapter 7. Use of Documents 

7.1 Introduction 

7.2 The Historian's Approach to Documents 

7 3 The Sociologlsfs Approach to Personal Documents 

7.4 Stetiiticaf Documents 

7.5 Prior Social Science Research 
Selected References for Further Study 


71 

73 

73 

77 

80 

83 

86 


87 

87 

87 

90 

9S 

104 

107 

107 

109 

IIS 

117 

121 

124 

125 
125 
125 
128 
12P 

132 

133 


Chapter 8. Observation 135 

8 I Introduction 135 

8.2 Observation and Values 136 

8.3 Selection of Phenomena for Observation 739 

8.4 Leis Structured and More Structured Techniques 739 

Selected References for Further Study 143 

Chapter 9. Simulation 145 

9.1 The Natureof Simulation 145 

9.2 Simuiolion end the Experiment 145 

9.3 Simufation and Closure 147 

9.4 Simuiafion and Level of Abstraction 148 

9.5 Computer SmiulaiibR ISO 

9.6 Simulation and Sequential Relationships 757 

Selected Fe/erences /or Further Sludy 152 



Contents 


PART THREE: MEASUREMENT AND SCALING 153 

Dmpter 10. Principles of MeasurenJeni and Scaling 155 

10.1 Strategy of Measuremeni f5J 

10.2 Measurement Vah'diiy 1S9 

10.3 lieliabiliiy and Precision 162 

10.4 Scales of Measurement 166 

Selected References for Further Study 169 

Chapter 11. Nominal and Ordinal Scales 171 

11.1 The Discovery of Concepts 171 

11.2 Nominal Scales 177 

11.3 Ordinal Scales 179 

Selected References for Further Study 195 

Chapter 12. Interval and Ratio Scales 197 

12.1 Interval Scales 197 

12.2 Ratio Scales 205 

12.3 Some Selected Measurement Procedures 208 

Selected References for Further Study 215 

PART FOUR: ANALYSIS OF DATA 217 

Chapter 13. One and Tm'O Variables 219 

13.1 One Variable 219 

13.2 Qualitative AnalysistConceptualiiationof Variables 221 

13.3 Two-Variable Analysis of "Reasons" 226 

13.4 Two-Variable Analysis of Structured Data 23 1 

Selected References for Further Study 23S 

Chapter 14. Principles of Statistical Decision Making 24l 

14.1 The Nature of Statistics 241 

14.2 The Nature of Probability 246 

1 4.3 Statist tcs and the Decisions of the Scientist 252 

14.4 Relationship betweenType I andType ll Errors 257 

Selected References for Further Study 262 

Chapter *15. Sampling, Statistical Testing, and Degree of Association 263 

15.1 Probability Sampling 263 

15.2 Tests of Significance 268 

15.3 Measures of Associosiort 273 

Selected References for Further Study 276 

Chapter 16. Multivariate Analysis 279 

16.1 The Causal Process 279 

16.2 Three Variables: An Ulnstrationof aSpurious Relationship 284 



CONTENTS 


16 3 Thrte Variables: An iHtatrationot an Intervening Variable 292 
164 Three Variabies: Anlilustrationof a Third Possibility 296 

Selected References for Further Study 300 

PART FIVE: 

APPLICATIONS OF LOGIC AND MATHEMATICS 303 

Chapter 17. Some Applications o! Logic and Mathematics 305 

1 7. 1 Language 305 

172 The Logic of Choice 310 

17.3 A blatbematicaiFornwlaTtonof atheory of Choice 

Behovwi- 317 

Selected References for Further Study 322 

Index of Names 325 

Index op Subjects 329 



SOCIAL research /S trategy and Tactics 



Theory and Method 


A basic undeistanding of the oature and functions of theory is essential for 
the student of research methods. Theory must be explored both as a product 
and as an element in the sciendfic process. 

A detailed illustration of a sociological research project will be presented 
in Chapter 1 with a view toward fflostrating the strategy and tactics of scien- 
tific investigation. No claim is made that the choices adopted in this study 
were the best ones possible. However, the presentation seems to provide ex- 
amples of many of the types of methods which will be discussed in subsequent 
chapters. Because of this and because of the relatively thorough discussion 
of various aspects of the project, references to this study will be made 
throu^out the volume. 

Chapter 2 presents the elements of sdentific inquiry,. The language of sci- 
ence largely consbts of concepts, proportions, and theories. The concept is 
the elementary common unit of both propositions and theories. The utility 



ot propositions and theories depends in great part on the way in which the 
corvcept is formulated. Propositions and hypotheses state relationships among 
concepts, and it is through verification of propositions and hypotheses that 
we infer cause-and-eifect relationships. Theories provide systematic state- 
ments of relationships among propositions, and they may be used for pur- 
poses of explanation and prediction. 

In Chapter 3 inquiry is presented as a dynamic activity of a community of 
scientists which is centered on the achievement of scientific goals. We may 
conceive of the scientist as being involved in a contest against nature in which 
he is called upon to select the most effective strategies and tactics possible. 
Some of the problems involved in such choices, as well as some of the means 
which have been developed for making these choices, are presented. 



Introduction to Research Methods 


l.I GENERAL ORIENTATION 
Some Functions of Social Research 

Social research is becominE an increasingly important activity in American 
society as tsell as throttghout the world. In the nniverstUes socta 
attained a degree ot respectability not tar nnnoved from “h.escd by 
physical science research. There has been a very rap.d “7' /“’j" 

reiareh activities in bushtess and goventment. « 7''.“ S 

agencies, since World Wat II. Increasing numbers of indtvrdtials am engag 
in social research, and it is no longer true 

activity performed solely by a small group ol .'china con- 

These developments have had, and will contmue to have, tar^eaehtng con 
sequences for the lives ot all of us. The fundamental of s^ttd sme^ce t 
to achieve accurate explanations and predictions o u annroach the 

many breakthroughs remain to be made before soaal scient. ts »PP 
aehiLments of physical scientists, lesv would deny that a P™^ 

ress has already been made in the scientihe study of 7™” 
knowledge of human behavior as hiK been o lame impossible 

for a wide variety of social technologies. It wou e i bv these 

to locate an, major sector ot life which has rt 

technologies In the field of religion, for 

rnade on the basis of social rese^h hich'allect degree of relig- 

tion patterns of church parishioners, the , 

iosity. and the effect of religiosity on ethical behavior. 

As 7^teTni will be basid on the hnd- 

and more decisions having to do wi^umM gs^^ is vast and 

mgs of sraal science „„„ complex. In addition to the 

seems to be inaeasmg as '“="““7“” „he, difficult problems, 

problem of averhng nuclear wari^.^ ^ *1- 

such as those relating to enme, menial iuncss. pu /. ^ 



INTRODUCTION TO RESEARCH METHODS 

though socialsciencc knowledge hMbecRl^J.0 be 

other problem areas, oor understanding of human behavior m P 

considerably it we ate to cope with these prohleins elTcctively. 

It would seem, then, that the importance of social J 

be overestimated. For the person with intellectual X'”' "Xe 

oJers hope ot fulfilling man’s ancient <,nest for , “a 

caught up in the misery of urban slums, social research can ^He the 

better life. For mankind as a whole, social research may help 
means of avoiding nuclear destruction. There u, of course, no g 
social research can lead to these results, but there is also no evidence that suen 
achievements are beyond its scope. 


Some flefie/5 /4fcouf Social Research 

In view of the importance of social research for catending the boundaries 
of our knowledge and for the development of eflective social tec no opes, 
pave responsibility rests on the shoulders of students of human behavior, 
their achievements or lack of them that will determine the degree to w »c 
the promise of social science will be fulSIled. The serious student of 
research methods might well commence by examining some of the be le 
about the field which pervade the popular culture and which, consequen y. 
may form part of his own thinking. This subject will be taken up in greater 
detail in Section 3.3. 

There is a good deal of feeling that social research is not at P***®”* 
scientific as physical science research, and that genuine social sciences wdll 
never emerge. There seem to be a variety of sources for such feelinp. Fot 
example, human beings ate viewed as very special entities, set apart from the 
rest of the physical universe, according to many different philosophical and 
religious systems. If human behavior were to be predicted by social science 
laws in the same way that the movements of physical objects can be predicted, 
then the human being might be seen as losing some of his distinctiveness. 

Most social scientists, on the other hand, believe that the behavior of human 
beings is the lawful product of certain causes in the same way that the behavior 
of physical objects follows from various causes. They feel that their methods 
of unearthing these causes are as scientfic as those of any other scientist, al- 
though they admit to not having adiieved the same degree of understanding 
of the phenomena they investigate as the physical scientists. However, they 
generally see no bar to reaching a far more effective and precise knowledge 
of human behavior. 

Within the popular culture much of the discouragement over the future of 
social science stems from a focus on the present inadequacies of many social 
technologies and technologists. There is the classic definition of the sociolo- 
gist as “someone who spends SIOO.OOO to locate a house of prostitution” and 





Iniroduction to a Research Illustration 


the numerous gibes at the “insane” or socially inadequate psychoan^j-st. Tbe 
limited effectiveness of marriage counselors as social technologists, for exam- 
ple. appears in marked contrast to the achievements of electrical engmeers as 
physici technologists. 

A serious reply to such criticisms might refer, first of all, to the 
achievements which social scientists have produced in a wide vanc^ of fields. 
Their present level of achievement should he viewed with the undemtandmg 
that these fields have been taken seriously as sciences only rerently, when 
compared to the physical sciences. Finally, there is the rather obvious point 
that the limitations in the present achievements of social scienKts do not imp y 
that there is little hope for future progress m this field. The relationships 
among any set of phenomena appear to be highly complicated when we have 
not dUcovered the fundamental laws involved, but this complexity may casdy 
disappear once such laws are discovered. Prior to their discovery, however, 
it is eSy to be pessimistic about ever being able to explain the phenomena 
Not an of the beliefs surrounding social research have then onS™* ^ 

culture. In tact, some of the erroneous ones are P™!”"" 
researchers themselves. One of these k that one pen meiod “ 

ipso focio better than other methods, regardless of the problem 
gation. Some researchers in the social sciences hpe a .^m aTew 

whether it be the experiment, the survey, or dettnled 
informants. Their conviction that a given method is superior to 
most problems rarely seems to be based on raUonal pounds, ■ a “"“P™,™ 
of the relative effectiveness of a number of methods ^ , 

problem. Such an approach may lead to the use of methods mappropnate 
a given problem. 

1,2 INTRODUCTION TO A RESEARCH ILLUSTRATION 

Just as the scientist finds it usefoi to employ various levels 
in the development and testing of his theones, the f ^ ^ “ 

to alternate abstract statements Md concrete examp cs. 

particular importance in discussions of soaai researc , . , qj ,j,is 

Ltract and fterefore challenging subject matter. 

chapter provides a detailed illustration of a . P ^ and to 

introdne; the reader to a wide range of methods in 'f ” V 

provide a frame of reference for points discussed msu^^ occupational 
particular study chosen for presentation ^„by medical smdents 

choice. Mote specifically, it focns» m the ^ selecL because of die 

choose among the various medical specialues. methods in this 

author’s ow-n research, involving a wide v^ety ° ^ discussed; each 

area. T.o phases of the ^ ^ 

will include definition of the research problem, data collection. ^ 



INTRODUCTION TO RESEARCH METHODS 
aM acling, and analysis d data. THe * '“'“‘I 

also include consideration ot maftennalira a j 7i»iil be alerted to some 
Through this mustration, it B hope that the ™der mil 
of the many choices open to the tcscarcher, as we d-cUions illustrated 

of the decisions which he males. Whether the "“"'“'“‘"“ered, since 
here are wise ones is a question which can „„ future 

it depends in great measure on the inlluence which the J 
research. Much of the remainder of this volume will be m„s,igatot 

elaboration of the diHerent strategies of research open 
and the rationale for each. The leader should espect to see 
the procedures used in the illustration which remain unanswere 
chapter, but which will, we hope, he answered in ensuing chapters. 


1.3 OVERVIEW OF THE STUDY 
For some years prior to the i»"P'ra'»'! 


in the tall of 1956, members of the American Public 
officials in the U.S. Public Health Service bod been concerned with recru tin* 
medical students to a career in public health. It had been diflicu 1 
the more qualified medical students to available openings in pu 
Although many explanations could be offered, none were supported y c 
prehensive and accurate facts about medical students in the United la es. 

It was felt by individuals in the American Public Health Association who 
with problems of recruitment that solutions to these problems restea 
answer to the question; “What makes physicians choose or reject pu ' 
health as a career?” . 

The decision was made to find the answer to this question by means o a 
thorough research study by social scientists. A research proposal was dratte , 
submitted to the National Institutes of Health, and subsequently approve . 
The study was to be national in scope, and the particular location selected or 
the project’s headquarters was the School of Public Health of the University 
of North Carolina at Chapel HilL 

The professional staff who assembled for the initiation of the project m 
September, 1956, consisted of a career public health physician who se^ed as 
project director, a sociologist, a psychologist, and a statistician. During the 
ensuing five years a number of different studies were made, most ot which had 
to do with either medical students or pubUc health physicians. The present il- 
lustration IS concerned with the data collected during the first two years of the 
study. 


' The definition of ihe research probletn, which conslilules Ihe initial step wilhin each 
phase of Ihe project, is the subject of Chapters 2 and J. Data collection, measurement 
and scaling, and analysis of data are treated in Pans Two, Three, and Four, respectively. 
The discussion of mathematics and theory within the context of the project is taken up 
in Part Five 


Sec.IJ] 


Overview of the Study 


Much research is best understood when viesved as a continuing process tn 
which the knowledge gained from the earlier phases of the mvestigatton is 
ted baek to the researchers and is utilized in their subsequent work. Ihis was 
certainly true for the medical specialties stndy. The information obtamed rom 
a number of small-scale studies completed doting the first year of opetanons 
was utilized to develop the research instruments employed m the 
survey of medical students during the second year. We might in fact desi^atc 
these two years as respectively eonstitoting the first and second phizes of he 
proieet. In a sense each phase stands alone as a research projeefi “"d ni *e 
Lainder of this chapter the two phases will be f ” “"”ng 

particular attention will be paid to the ways in which data collected during 
the first phase shaped developments during the second phase. 

Although the p^et staned out purely as a semee f 

aid in the recruitment of qualified medical studeuri to pu ic rjie 

set for the study were soon broadened. ITie social 
project were, as a result of their own professional f ^ 

ezploring the possibilities for making a general X 

For example, information about the choice of pub he heahh by 
dents was seen within the context of the S'”'"' 
choice. Research efforts were consequently expanded 
the process whereby medical students choose other medical fields, e.g., 

"Sg«'pS-: rXant literature was 

made with other individuals and organizations engaged 

number of individuals who were deemed w^s 

about the specialty choices of medical students 

decided to toms attention on a national survey of medical stu^ 
successive drafts of different parts of a queslionnajre were 
tested on medical students. One type of question students who did 

the scores of public health physicians wiU, ° the of 

not intend to% into public health. A samp^ o” si 

Medical Specialists was analyzed to ^ questionnaire 

in public health with those of other specials . a sample of 

was constructed on the basis of results o e a ^^v ^^q ^ elected, and 
medical schools from schools tbroughoul *e , j^pols chosen. The 

arrangemems were made lor field work w.th those "IT Irees were 

first phase of the stndy ended when results of these iniual aualjses 

written up for publication.* the completion of 

The beginning of the second handling of the data collected 

field work in the eight schools selected. Th sizable task, but it 

from the 2674 medical students involved proved to be a saao e . 

'Kurt W. Back. M aU “Public Health « a (1958). 

Within a Profession." Socit'to^'eaf fter« • ^ 



INTRODUCTION TO RESEARCH METHODS 
was co.ple.ed in dne con.e wi.h aid 

analysis, interpretation, and writeup were f««sed ^ ,, 

medical stodents come to rate lbe.r interest m a S v " ^d a 
“low.” » The two factors ohlizcd m this »"“*y*‘* „pcctations as to the 
occupational goals of medical students, and ( ) medical fields, 

opportunities for achieving them in each of a number of medical 


1.4 PHASE ONE 

Defininon of ihe Research Problem 

The decision to investigate the process of choosing mescal ' 
than that of public health grew out of the interest of the social * 
the general process of occupational choice. This bas^^ on g ^ 

of certain similarities in the sequence of cvenu leading to t e ^ .j, 

medical specialty. Because relatively few medical students arc inte« t d 
public health as a career, definition of the problem in terms of p . 

alone would have made it extremely difficult to locale a sufficient numbc^ 
individuals on which to base a thorough and comprehensive study, 
broader definition allowed the investigators to bring a great deal ot a 
bear on the problems and made relevant the results of earlier studies 
process of occupational choice. In addition, any progress tnade wou 
represent a contribution to general knowledge of the dynamics of the c 
process. . . • m. 

The task for the project staff came to be defined as one of obtaining su 
cient knowledge of the choice process so that intelligent decisions^ mlg t su 
sequently be made for dealing with the recruitment problem. This illusira es 
the role of the behavioral scientist in many diflercnl types of applied setting, 
rather than being asked to come up with specific recommendations or 
decisions, he is often asked shnply to collect the information on which those 
decisions are to be based. 


Data Collection 

At the beginning of the study there vras a considerable amount ot groping 
for effective methods, concepts, and theory. Among the procedures tested was 
a series of interviews with teachers in mcdic^ school. This was followed by a 
senes of questionnaires admimstered to small numbers of medical students. 
For example, some students were asked to write an “autobiography — from 
now to the future,” the purpose of wUch was to obtain information on their 
aspirations and future plans. Included were such questions as: “How did you 
‘Bernard S Phillips, "ExpecCed Value Deprivation and Occupational Preference.' 
Soefome/ry, XXVII' 1SU160 (1964). 



Phase One 


decide to study medicine?” “Do you usually plan, or do you act on the spur 
of the moment?” . . , 

Some students were presented with a series of ambiguous drawmgs and 
asked to ima^ne each one as some kind of situation and to desenbe how the 
situation came about, what was happening, and how it would probably turn 
out. ntis is a variant of the T,A.T., or Thematic Apperception Test, a test 
commonly used by clinical psycbolopsls. . cj ^ 

One of the standardued psychological inventories used was the Edwards 
Personal Preference Schedule, in which the student is asked to choose one 
statement from pairs of statements such as: 

A. I like to help my frierKlj when they ote in trogble. 

B. I like to do my best In whatever I undeitoke. 

This schedule of 225 paired statements provides measures of such personality 
characteristics as achievement, deference, ahasement, autonomy, dommance, 
aggression, and nurturance. . 

A different kind of research ™ The 

viated career descriptions found In the Directory of Me tea p • 
backgrounds of specialists in public health were compared whh toe ol 
specialists in intemal medicine, surgery, pathology, and psyc i t^. . 

pose here was to unearth any diHerences existmg between the career histone 
of public health physicans and physicians in other specialties. 

One study involved a total of 253 students from the tour 
eru medieai school. AU the 30O.«Id medical students “ ^ ' 

requested (but not required) to attend an assembly at “ 

a? out a questionnaire One part of the questionmure had to do with the 
social structure within the medical school. For example. 

A. Among the other medical stvdenh: 


6. Among the faculty^ 

Who ij the best teocher? 

To whom hove you talked most about the future? 

Measurement and Scaling 

Another part ol the Awhat 7o'* w™M like 

the following fields of medicine, indicate ^ w , u-ould like least." 
most about Lking in that field and in Column B what you 
The eight medical Seids listed This 

medicine, pathology, psychiatry, public heali*. important to 

question was designed to determine what ^ desipate 

medical students in their thinking ahtml medica . 

it as the open-ended values question. ^ 



INTRODUCTION TO RESEARCH METHODS 

The following constitutes the question as well as a series of fairly typical 


With respect to the following fields of medicine, indicate in Column A what you would \ 
, like most obout working in thot fietd ornf in Column B whol you would like leas • 




B 


Would Litre Most 

Would Like LeosI 

Dermatology 

High income 

It's difficult to cure patients 

General proetlee 

Being o fomily physicion 

Very hard work 

Internal medicine 


Hoving Ihe type of patient 

rnses 

who is net very cooperotive 

. Pathology 

Doing reseorch ond the scien- 

You don't deal with patients 

tide status of Ihe field 

directly 

1 Psychiatry 

The chonce lo learn about 

Involves too much emotional 

people 

Siroin 

5 Public healih 

The working hours 

Not being oble to practice 
medicine 

1 Surgery 

I'd be able to use my hands 

Misiokes could be fotol 

1 Teaching 

Peseerch end learning 

Not hoving potienis 

[ 

experiences 




ing of measurement and scaling rather than under data analysis. The former 
topic is actually a part of the latter one. The distinction is made, how- 
ever, on the basis of the degree to which the raw data collected must be 
further manipulated prior to the Iniliatioa of the basic cause-and-effect 
analyses. 


The purpose of analyzing the approximately 4000 responses (sixteen cate- 
gories of responses for each of the 253 medical students involved) was to 
leam about the fundamental occupational goats of medical students. An open- 
ended question such as this allows the respondents latitude for indicating what 
is most important to them. The key problem was to reduce the variety of 
answers to a small number of categories which could provide a focus for later 
stages of the project, These categories would then constitute the basis for 
measuring occupational goals, wth each category referring to a different goal. 
After reading a good many of the answers, investigators established a number 
of categones which seemed to distill most of the responses. Thus, for example, 
closeness of relations with patients summarizes the responses being a family 
physician, you don’t deal with patients directly, and not having patients. When 
these categories had been compared with the data, and revisions and addi- 
tions had been made, the result was a set of goals which were then utilized 
m subsequent phases of the project. 



Sec. I.4\ 


Phase One 


Analysis of Dala 

Much of the early rvork— e.g., the open-eirded values question-proved to 
be suggestive in pointing the way to the development of more caleulatedly 
structured instruments. Some of the methods tested, of course, »«e found 
to he inadequate. For example, the series of ambiguous drawings P'oduad no 
elear^iut diHerentiation of responses between public health f 
dents not interested in that field. The Edwards Personal 
on the other hand, revealed that students not interested “ 

higher scores for achievement and aggression needs “ 

students had higher scores for deference, abasement, and 
(It was later decided, however, to esclude this instrument from the already 
long questionnaire because of the many questions involved.) 

■fhe analysis of data from the C/recory of Medicnl “ “ 

be revealing with respect to career patterns, and con- 

Table I-l. Qualification as a specialist in a diflcrenl field, wh P 


TABW 1*1 

EXPERIENCE IN OUTSIDE ^ 

PUBUC HEALTH. INTERNAL 

PATHOLOGY, AND PSYCHIATRY 


I . 


Public 

Health 


Hed'rini Surpery Pnlholusy P^thlolry 


Percenfoge qualified 
In o different 

ipeclolty 7.0 

Percentoge not quolr- 
fied in a different 
jpeclatty 93.0 

Totol 100.0 

; (soo> 


100O 

025) 


96.0 

100.0 


1000 

(135) 


100.0 

(103) 


siderablc experience in a second specialty and 'Jll'iJc'wUh dala on the 

often among specialists in public health. ^ confirmation for the 

ages of individuals in the different ficld^ ve'^rs after craduation 

idea that phj-sicians typically enter jniblic health ^ medical fields, 
from medical school and after work in one or freuucnt efforts of the 

The numerical data found in T^ble 1-1 ^y the 
social scientist to quantify the information he bv non-numerical 

not imply any depreciation of the important ro e p 



c. ICh. 1 

INTRODUCTION TO RESEARCH METHODS 
2 Aca.,,. .= .a»r .»aan, a nacesaa. 

school revealed aome evidence of mala and conclnaiona on 

,„o.he,hprefereneealor.heva,.oos.n^rf fieUa^ 

ihia aapeet of the analyals are presented m Section 
“Social Relationships.” 


1 5 PHASE TWO 


Redefiniiion of the Research Problem 


The research was more closely defin^ in the “^ic^ students 

Once the decision was made to study the ^ . ,1 aooroach to be 


as maoe lo siuuy u«. ^ t 

choose a given field, it remained to specify the 


choose a given tieia, it rcmamcu lu r including per* 

taken. During the first phase the approach was , he exLctatioM and 


sonality characteristics, peer sequence 


goals or values of the medical students, the concept of ^ ® 

of occupational choices, the Idea of degree of commitment to a cMic , 


wpational choices, tne loea oi uegrcc _ 

notion of a “deviant" medical specialty, and the concept of p 

r rational choice, as well as on >■ 


The second phase focused on optimum or rational choice, as we 
expectations and goals or values of the medical students. A ^ 
recognized that other factors were involved, the object was to see 
more delimited framework could take the investigator. 

This redefinition of the research problem enabled the 
mulale particular hypotheses, i-C., tentative statements about real 
among phenomena. These hypotheses related the specialty choice o . 
students, on the one hand, with certain of their values and expectations o 
other. They emerged from the first phase of the study, in which a wi e u 
brella was provided for most of the important ideas about factors whic a 


the medical student’s choice of specialty. 

For example, many of the personality characteristics of medical studen 
which seem to be involved in the chmee process can be subsumed under t eir 
goals or values, e.g., degree of interest in close relationships with patients, 
desire for a hi^ income, interest in helping people. Much of peer influence 
and faculty influence can be conceived of as expectations which these indi- 
viduals send out to a given medical student. Such expectations are then re- 
ceived by the student and play a large part in shaping the development of his 
own expectations about, e g.. what the different specialties have to offer. Two 
basic hypotheses were involved: 


1. Ths gooli o 

n.«iol.y. 
z Tlx «ip«tc 


nedieot itudenti haw about the eppe 


mh offset or ore related to their choice of 
for ochieving do- 



phase Two 


,i„d oool. ta.g.. t.lpino kigl- 

cf specialty. 

Neither of these hypotheses is e particularly startling one^ut it 
home in mind that the stuff of science often appears tnvial. The scienust g 
erally strives to discover the correct formulation, whether or not it is a new 

“xhe redelinition of the problem went one step further than the 
of these two basic hypotheses. One important way m which research pr^eeds 
is to combine separate hypotheses about a given phenomenon « ' 

achieve a more comprehensive explanation and more ^ ^ 

When this is done, we say that the researcher's formulation becon^^es mom 
theoretical. Although the term “theory” has connotations 
practicality, and uncertainty within the popular culture, i „ 

notations io, the scientist. To him it generally mean, a 

or set of statements about phenomena, a formulation w ic --ovide a 

one which can be tested directly or indirectly, an one w 
great deal of additional knowledge. Such knowledge inay 
for effective social technologies, and consequcnlly may be extremely p 
In the applied sense. , it is 

The combined formulation of the iwo ^1'^''’'“' " ' ^if io the 
theorized that medical students are interested in ‘f, ‘Xmtod. 

degree that their expected depnvaUon lor '"“™p refationships with 
For example, one student may be very interested expecta* 

patients and another may not. Both will mwt ^ oDoortunity for close 
tiom about the field of pathology, i.e., as offering i relation- 

relationships. In such an event, the student who 

ships would be seen as suffering expected ‘ ,s ^ values than 

ogy, but not the other student. Of or, more 

close patient relationships are involved, and ^ are 

specifically, expected value deprivation, has to do with all of those wmc 

"'m or any other formulation of the choice P'“'“ b”5eSs 

pletcly incorrect. Hypotheses and ihconw are nh^nomenon. Rather, 

as providing the last word for the explanation of a give p greater 

the, are viewed as one step along a road “'"/"tore seems 

undcistanding and more accurale P"^''”°”*' subscqwnlly cither thrown 
to indicate that most hypotheses and „ntributc’^nothing to the 

out or reformulated. This does not mean that > , -.rtain paths of 

progress of science. Rather, they serve ^ comprehensive and 

inquiry or they constitute the building blocks 

accurate formulations. ^renirular research niuslration. 

This process can be seen in smafl within ih p research in phase 

The early studies during phase one provided 



INTRODUCTION TO RESEARCH METHODS ^ 

two, with some paths of inquiry being abandoned and ^ 

lated. This revised focus, the relationship between expected va P 
and interest in a given specialty, was flien tested in phase ° 

Results indicated that a further reformulation of the theory might p 
more fruitful, and this is described in Chapter 17. This new forrnu a 
bines a concern with expected value deprivation, on the one . 

concern with expected value fulfillment on the other. Man is seen as 
choices so as to seek gratifications as well as to escape depriva lo • 
research process will then continue with the testing and revision o s 


mutation, etc. . ... 

The concern with value deprivation and not value fulfillment during p 
two was not the product of a conscious choice between the two. “ 
of a research context in which there was a great deal of interest in w y a mo 
all medical students did not choose public health as their specia ty. ’ 
the immediate problem at hand was to explain why students were avoi i 
public health, and the deprivation formulation seemed quite appropna e. 
may speculate about whether a project oriented around a more popular 
might have produced a gratification rather than a deprivation formulation. 


Data Collection 

Selection of appropriate instruments for data collection was an early and 
fundamental step in the research project, carrying heavy implications for the 
entire study. Early work on the study indicated that a number of different ways 
of looking at the process of occupational choice seemed potentially fruitful. 

U appeared to the investigators that a survey might be the best instrument for 
combining various approaches to the problem. One point in its favor was that 
the technology of probability sampling can be used quite readily within a 
survey research design. This meant that it would be possible to generalize the 
results of the study of a properly selected sample of medical students over al 
medical students in American society, with a limited and known degree of 
error. In other words, probability samples are representative of larger popu- 
lations This generalizability was important to the American Public Health 
Association, where the project was initiated, because recruitment of medical 
students to public health was conceived of as a national problem. Encourage- 
ment to use questionnaires was found in the fact that some of the most inter- 
esting results obtained from the first year's studies had come from question- 
naire data. Self-administered questionnaires were favored over interviews 
because of the vast difference in costs, as well as because the investigators had 
already found that a complex questionnaire instrument could be handled by 
medical students with little difficulty. 

Closely related to selection of the questionnaire survey as the primary 



Phase Two 


mstrnment of data collection in the second phase of the project was the 
decision as to the most appropriate procedures tor f ^ 

cause generalizability of the results of the study to medical students throughou 
the continental United States was deemed important, some type of 
probability sample was essential. But because any project is ® 

Lancial Ld hLan resources, only a limited number of medical schools 
could be involved. It was therefore decided to focus the entne resources of 
the project on a very few medical schools, and in this way oehm™ a study ■" 
depth. The outcome was the selection of eight medical schools un 
pr^edure that also assured geographic coverage of . 

considerable variation among the schools in the fraction of graduates ente mg 
general practice. (This last safeguard seemed to be an objective ^ 

hig that schools of different quality and different patterns of specialty choice 

'"For“cdlection of the data it was important to obtain the 

the eight medical schools selected. An immediate loss o ev 

the sample would make far more tenuous any j secure 

medical students throughout the country. Efforts were ere through 

the support of a number of highly prestigeful medica 0^6 reoresentatives. 

the establishment of an advisory committee made up o -.JUai schools 

The director of the project (a physician) then v.s.ted 

selected, and arranged for special two- to-three-hour . » jmdents 

by the medical students in each. (Some schools forma ^ ^ 

to attend the session; others put attendance on ^ gnj also 

fessional stall of the project subsequently supervised ^ 

arranged for local interviewers to administer the ques 1 ouestionnaire 

possible of those students who did not attend. The result - / 

completion rate of 91 percent for all schools. “/“"S' 

schools of 80 percent to 100 percent and a total of 2674 q 

^ Although the questionnaire included many 

those utilized in the second phase had to do wit goa , definition 

preferences or choices. This selective emphasis from^ "^"zation 

of the problem, which conceived preferenre or^ jp 

as a function of expected value ® . somewhat complex 

measure, two different techniques were adopted, une is s 

and is discussed in the ensuing subsection on question. 

other is quite straightforward and is based on the structured xoU.es q 

'^hich is presented in part on the following pace. i-.-^fieator to measure. 

It should be noted that this question pcrmi crlected value or goal 

^^iihin five approximate degrees of imporiancc. each selected 

of a given individual. 15 



INTRODUCTION TO RESEARCH METHODS 


Moil Dsoote ho'ie som« Idea of whot they would wont in on ideol position i o„ 

they could dreom up o iob which hod oil the elements they like. 

would each of the following elements hove In your ideol job? Put o chec i 

A, B, C, D. or E 


In my Ideol job I 
would like on op- 
portunity fori 

worm personol 
relationships 
with pollents 

B. Being vlrtvolly 
certain that my 
specific inedicol 
actions will lead 
to the desired 

C. Having prestige 
among my 

[ eolleogues In 


Indt*. Exffemely Very FoWy Rother : 

pensoble fmporlaitf fmporlonf Importonf 


position to moke 
o contribution 
to knowledge 
E. Moving the 
chance to help 


16 



INTRODUCTION TO RCSCARCII MCTIIODS 

When data from this question are combined seilh 
tured values question, measures ol “espected value depnvat.on may 
obtained. (The procedure involved mil be taken tip on p^21.) , 

Finally, a prejnencf-rankins question etw-i'cd ralinps of 
in a numbei of medical fields; * 

How woktid yoo rortk iKe followlfta M<h o» occofdl^a •» V®'" 

worli«g In .h.m? (1 for iK. .peclolty ,p« would b. 

3 to 6 the ono» you ore leo« Inlerekted to wotlitng In) 


General practice 
Internal medicine 
Pathology 


Ptychiotry 
Public health 
Surgery 


This rating constituted the cffccl which the Investigator was 
explain. The tools for its explanation had to do with the concepts of go 
expectations. 


MefliitremenMnd Scaf/ng 

Of the questionnaire's two techniques for measuring goals, the more ® * 
rate was the carters value question. Careers of four physicians represe 
different medical fields were described, and students were asVed to . 
each aspect of the different careers. In the following illustration of a surgi 
career, students were asked to underline those sentences referring 
thing they like, to cross out those sentences refenmg to something dis i e ■ 
and to make no mark where they had no definite feeling one way or the ol e • 
CAREER B 

During your InterruSip and rojldoocy you raoliied, far mora eoneraiely then you 
hod befopi, thot a high I*t«I of mertuol d*«t«ti»y wo» ntcarwry for lh» fi»ld of 
choice. Why woi thl» to ImpoptontT The omwoc wo» cb-riou*. In molor oporaliont, o 
jUghi miitoko might prove folol to the polienl. However, th* moiorily of opo'O'ioni 
were purely routine motierj, wWh elmllor techttlquee vied over ond over again- Still, 
the cmolionol pretivrei involved were often extremely great. 

After completing your reiidcney, you look luvther woth In o ipeciol field within 
your tpecialty You Ihen let up ptoclK* In the city of WoVe, which hot o populolion 
of 1 million In the eniuing yeori you hove tome M depend On the referrols of elhen 
for much of your practice. 

The lervico you give to your potlenti, the degree to which you ore oble to help. 

IS great. The chonges for the better which you ite ore often quite ttrihing. And, ol 
courie, the esteem you command omorrg laymen In generol is very high. 

Your working week comet to oboul SO houn- Some ol your time is ipent In getting 
out numerous hospital reports. Occostonally, eome emergency work at unexpected 
hours Is necessary. The work usually invelvet a greol deal ol phyticol exertion, so 

* The preference-ranking question actually called for students to rank dermatology and 
obstetrics-gynecology m addition to the other nx fields. However, because the expecta- 
tions question deals with only the six medical fields, no analysis was made with respect 
to preference for dermatology and obstetrics. 



Sec . ; J] 


Phase Two 


much » that you ore often completely exhausted. In oddifion, your work b often 
emetionolly taxing. Your yeorly income fifteen yeors ofter your M.D. degree comes 
to J25.000. 

You bove felt, with respect to fheropy. thol the eceumuloted knowledge which 
could be brought to beor on ' -'^r' polienti' Alnesses hos been great. Results hove 
usually been immediole and de. i“ r' ‘‘‘•r than drown out ond vogue. 


There were several reasons for using a question of this design. One was 
that students found it to be rather interesting, and it broke the monotony of 
filling out a very long questionnaire. The more interest a question has to the 
respondent, the less likelihood there is that he will rapidly fill in any kind of 
answer simply to finish the task. Another reason was the hope that the con- 
creteness of the descriptions of the different possible goals would be more 
meaningful to the student than the structured values question. Also, the 
careers value question can distinguish among those goals which arc 
to the respondent— those that he cither underlines or crosses out— and there 
which are not In the structured values question, on the other hand, rc^nd- 
eots arc required to take a position even if they do not feel strongly about tt 
one way or the other. Finally, the question brings several different measures 
to bear on each goal in a way which neither bores the respondent nor questions 
his honesty. . 

It is the fact that several different items are used to measure the «me goa 
that enables the Investigator to formulate a scale for each goal 
bines scores on different but related items and yields a single ov er- sco 

for each goal. Almost all the scales consist of three or four items, with cacn 
item appearing in a different career description. Thus, for example, c ose 
o/ patient relationships is represented in three separate career escrip lo , 
as follows; 


t. During your year* of proclice. yov hovo developed clo»« ond long4tonding 
*lans.Sip» «ti<h mony of your patient*. .. 

2- In the yeor* you hove been proetkirrg, you ho»e come to know o mo* o 

potient* ond their fomilie* Sntimolely- t j vm>r 

3. Your eentoct with Individual poKenf* ho* beerj limited becou»e in mue 
*ork you do r>ot deol directly with them. 

Individuals who underlined die first two sentences and crossed out the ^i 
were p\en a score of 3, indicating the highest degree of interest in c osc 
with patients. Conversely, individuals who did not marl any 
«ctenccs were scored 1, the lowest degree of interest. Individuals w ' . 
“I'dctlined one of the first two sentences or crossed out the thud . 

^ radicating a limited degree of mtcrcsi, and those who indicated ‘"Cir J • 

® two of the three sentences were also scored 2.* There were • ^ 

**0 different response paftcni* were lumped lopcther so *"*V^*^^*f, ob- 
^ would be three. This » the same as ihe number cf ^ ,t tho<.ld 

la the especlasion* question. Wrth respect to the espectaiions q 


19 



INTRODUCTION TO RESEARCH MHTHODS ^ 

responses indiealing disl.le lor dose relationships 

scoring procedure combines the data tor the three d.Herent .terns 

single scale based on all of them. 


Analysis of Data 

The problem selected for analysis was to try to explain and ^ 

students’ preferences for the different medical fields. These pre er , 
expressed in responses to the preference-ranking question, in w ic j 

were asked to rank sbe different fields of medicine according to e 
in them. Although it would be useful to be able to understan an P 
effectively a student’s exact rankings, such a goal is too lofty for pt^sen 
edge m this area to fulfill. The tactics adopted involved the class'^^” 
each ranking as “favorable” or “unfavorable,” depending on whether e 
was 1-3 or 4-6, respectively. . ■ .g 

The analysis centered on trying to understand why students rate ^ ® , 

field favorably or unfavorably, and the two concepts utilised were va 
expectations, Because the two values questions produced consistent re • 
it is enough here to present the analysis of data from the careers value ques • 

A rating of 3, for example, indicated the highest degree of Interest in c 
relationships with patients, whereas a 1 indicated the least degree of m re 
in this goal. This same type of scoring was used for nine other valuest oppo 
tunities for learning, research, involvement with complex problems, 
abilities, helping people, prestige among colleagues, high income, hours 
are not extremely long, and work without great physical exertion. Studen 
received ratings of 3, 2, or i on each of them, depending on their degree o 
interest in each, 

With respect to expectations for achieving various goals in the different 
medical fields, the data were obtained from the expectations question. Students 
were asked to rank six fields of medicine on each of a number of values. Thus, 
for example, if value is placed on the chances for developing warm person 
relationships with patients, general practice might be ranked first, and pathol- 
ogy might be ranked last. The scoring procedures on the careers value question 
and on the expectations question were designed to have the same range: 1 to 3. 
Thus, a score of 3 indicates the greatest chance for achieving a given value, 
whereas a score of 1 indicates the least chance. Those fields seen by a ^ven 
student as offering neither greatest nor least chances for achieving a given 
goal received a score of 2. 

It would have been possible to conduct the analysis by separately examin- 
ing the relationship between occupational values and occupational prefer- 

be noted that there seem to be six possible ranks which may be accorded to any given 
ne a as to the opportunities for a given value: 1, 2, 3, 4. J, and 6, Actually, much of this 
rnol TS* °t' ®f Ibe questionnaires in an effort to save time and cut 

*h» r«At, ®''3ilable information on which the study is based has to do with whether 

toe rank is 1, 6, or somewhere between 2 and 5 inchiuve 



Sec. IJ] 


Phase Two 


ences, and between expectations and preferences. As a strategy for research 
this did not SEcm 10 lead to advancement of knowledge on the subject, because 
other studies had already indicated that values and expectations are factors 
involved in the occupational choice process. By putting the concepts of values 
and expectation together in a specific way, however, one can glimpse the 
beginnings of a theory which might add to our knowledge of the P'”"’; 

The specific way of putting these two concepts together was to measure the 
gap between what the student wanted in a career (his values) and what he 
Lpected would be the opportunities for achieving these goals m a ^ven 
earner (his expectations). This gap would he his expected 
for a given field. Thus, for example. Table 1-2 illustrates the situation of a 


TABIH 1-2 

PARTIAL PROFILE OF A STUDENT'S SCOJJJe 
PATHOLOGY AND GENERAL PRACTICE 


Values 


Value 

Stares 

(V) 


Pathol- Pofhol- 
ogy ogy 
fxpec- Depti- 
talicns vatian 
(EP) (V-fP) 

"3 3 — 3 = 0 


Generol Cenerol 
Procfice Procfice 
Expee- Deprt- 
foflons vallon 
(EC) (V-EG) 


1 


-1 s 


Ressorch 3 

Clo»e patient ,i _0 3 1 — 3-*0, 

releflonjhips 1 ' * ^ 

Expected value depfivotiani ® ' 

student who is very interested in research (value ot 
interested in close patient relationships (value sco e . ^ jjjgj joj 

deprivation for pathology, where he XS Sma? 

research, is 0 . However, he perceives general P'“‘'rfL “ation tm general 
opportunities lor research, and his ex^t I* “ j, j, „ta,ed 

practice is 2 . The negative score (1 — 3) is no . .t. <.,udcnt is only 

L 0 (1 - 3 ^ 0) because it represents . aUairPrem™V! 

minimally interested in a goal be perceives y achievement 

if goals L not important to an Mivid-1. 1»^« 'Xei: in .h=. fid-.- 
in a given field would neither increase nor d . . for a student: 

It should be noted that Table 1-2 presents on y p . ^ expectations 

scores are presented for only two of the ten con- 

and deprivations are listed for only two of relationships, these values 

sideration. In addition to research and close p complex prob- 

were actually takcu imo account: learning, " f-S h rr.^a.n< 

in Staion 17.3. 21 



INTRODUCTION TO RESEARCH METHODS 

lems, utilizing abilities, helping people, prestige among colleagues, high in- 
come, hours not extremely long, and work without great physical exertion. 


PROFILE OF A STUDENTS TOTAL DEPRIVATION SCORES, 
AND THE DEPRIVATION RANKS ASSJONID TO 
THEM FOR SIX MEDICAL FIELDS 


Total 

Depfi- 

volion 

ScQr« 

(10 Volueil 
Dep'iyolion 




General Internal Psychl- 

Pracliee Medicine Surgery atry 


Palhel- Pubtie 
ogy Health 


In Table 1*3, the expected value deprivation score for a given field i* 
anived at by summing the positive deprivations for all ten values. Each stu- 
dent IS given six such total expected value deprivation scores — one for each 
of the medical fields. These scores are then ranked for each individual, with 
the smallest total expected value deprivation being accorded a rank of 1, and 
the largest 6. To the extent that the theory under consideration is a good 
one, those students with the least expected value deprivation (the smallest 
expeccd value deprivation ranks) for a given field would actually rate that 
field favorably in answer to the preference-ranking question. On the other 
hand, those students with the largest expected value deprivation (the largest 
expected va'ue deprivation ranks) for a given field would be expected to rate 
that field unfavorably. 


EXPECTED VALUE DEPRIVATION RANKS ASSIGNED TO MEDICAL 
STUDENTS AND PERCENT OF STUDENTS WHO ACTUALLY 


L .. 


RATE FIELDS FAVORABLY 

Deprivotion Rpnks * 


■J 


ilage who give fovo 


e vnfovoroble icrtings 29 40 


•n«d ra«l, or 

•o">pl« «< J674 bi 


100 100 100 100 100 



Phase Two 


Table 1-4 summarizes the basic findiogs of the study. The central h^oth- 
esis is that medical fields are preferred to the degree that their expected value 
deprivation is minimized. The left-hand entry under “Deprivation Ranks, 
for example, indicates that, of those individuals (1213) who are given M 
expected value deprivation rank of 1 for any of the six medical fields. 86 
percent rate that field favorably. The right-hand entry under “Deprivation 
Ranks” signifies that 17 percent of those individuals (1844) who recede a 
deprivation rank of 6 for a given field actually rate that field favorably. Thus, 
if we know that a student receives a deprivation rank of 1 or 6 for a given 
medical field, we are able to predict successfully whether or he woul 
actually rate that field favorably a high proportion of 
provide confirmation for the central hypothesis of the stu y w erein 
between goals and expectations for fulfilling them is vie«e as an imp 
factor in the processes leading to preferences among medical fields. 

Theory involves putting together separate ideas or hypotheses J 
atic way, and this often results in the putting together of many s 
of data. The resulting chore of data processing can be immense, ^d th 
true of the present study. For each respondent, approximately 
information were utilized to secure ratings on the ten va ues i -.p-cl 
value question, or an average of three Items for each W 
to the expectations question, each student ranked the six me 
each of the ten values, constituting a total of sixty rankinp ^ 

to this the student’s ranking of his actual Interest m the six - .qq 
grand total of almost 100 items per respondent, or approximately 250.0UU 
separate scores for a sample of size 2674. 

Mathematics and Theory 
“Theory” refers to a set of related statements or 
phenomenon. To the extent that the relationships among ***' ^ 

explicit it is possible to state theory In the language o ma .u.pfy more 
procedure is usually a valuable one. since it serves to ma 
systematic than previously and to provide more spcciicprc i :'t]„.trated 
The diflerenee between a ntore and a less caplieit theory 
We may start with the feneral idea that factors .f and f’.f' " “ “^a »a 
together, cause a change in C. This becomes more exp ici > ^ 

say that factor /4 produces the same degree of change 1 ^ 

factors A and D have no effect on one another. Exp proportional to 
creased still further by stating that the change m C is directly p po 

present fbmtalathrn we 

mathematical statement which combing the and 

tions into that of expected value deprivation iL . rvD and 

the mathematical statement which states the '* jtatetneot 

interest in a given medical specialty, on the other. Both HTCS 



INTRODUCTION TO RESEARCH METHODS ^ 

are important for most mathematical formulations. In ^ 

ties together in a highly explicit fashion those factors which are , . 

as having an important impact on the student’s preferences among p ■ 

The second mathematical statement, which relates EVD to in 
given specially, is crucial in that it makes use of the theory for purpo 
explaining and predicting preferences for the specialties. 

More specifically, the first mathematical statement is based on a co p 
non of expected value deprivation, that is, the gap between the student s inier- 
est in certain goals and his expectations as to the opportunities for ac 
them. In particular, the expected value deprivation (EVD) of a 8"'®" 
vidual for a given medical field is the summation (2) of the difference e 
his value scores (V,) and his expectations for fulfilling the correspon i 
values (Ei) over the set of ten values. Thus; 


EVD = % {Vi- El) where F, - E, is positive 

In this formulation it is important to note that only those difference 
which are positive — i e , where value scores arc greater than their correspon 
mg expectation scores— enter into the summation. In these situations som 
thing is desired but is conceived of as relatively unavailable, and this 
to be the kind of context in which it fe most meaningful to speak of expec 
value deprivation. Where something is not desired but is available, for exam- 
ple, the concept of expected value deprivation does not seem appropria e. 
Thus, where V, — E, is negative or 0, nothing is contributed to expected value 
deprivation. 

As for the second mathematical statement, it is based on the idea that the 
greater the individual’s expected value deprivation for a given field, the greater 
IS the likelihood that he will not prefer that field over others. This may be stated 
symbolically as follows: 


EVD,., = k ■ p(unfavorable rating)/, , 

This reads "The expected value deprivation for a given field (/) by a ^ven 
student (s) is equal to a constant (i) times the probability of an unfavorable 
rating of the field by the student.” More succinctly, degree of expected value 
deprivation is directly proportional to the probab'ility of an unfavorable rating. 

Such mathematical formulations as this are most useful when they cart be 
tested by actual data. From Table 1-4, we have these data: 

^ Deprivafion Ranks 

, - 3’ “ 5 ; 

; Petcetiloge who give unfovorable ratings 14 29 40 56 63 83 

If a student’s EVD for a given field is hi^cr than for all other fields, he will 
receive a deprivation rank of 6 for that field. If, on the other hand, his EVD 
for a field is lower than that for other fields, his deprivation rank will be 1. 
24 



Selected References for Further Study 

From these data we can indeed sec that the greater the individual’s EVD for 
a given field (relative to other fields), the greater is the likelihood that he wOl 
rate that field unfavorably. Only 14 percent of students who had a minimum 
of expected value deprivation for a pvcn field rated that field unfavorably, 
whereas 83 percent of those having a maximum of EVD for a field rated it 
unfavorably. The degree of EVD is in fact related to the percent rating a fiel 
unfavorably, and this percent can in turn be interpreted as the probability that 
a given individual will rate a field unfavorably. Further support for a mathe- 
matical formulation can be seen in the fact that the percentages increase in 
approximately equal amounts for each change in the deprivation rank. This 
is an indication that the relationship between deprivation rank and the ^r- 
centage of students giving unfavorable ratings is one of direct proportiona i y. 


Future Research 

There are many different ways in which research can proceed from thi 
point. One of them has to do with a broadening of the formulation ^ 
expected value fulfillment as well as expected value depnvation, and this w 
be described in Chapter 17. In such a conception man is seen to be seexing 
gratification as well as avoiding deprivation. 

Another direction for research has to do with the developmen o P 
techniques for measuring values and expectations. For examp e, 
utilized in the medical student study were accorded equa wei , 
entirely likely that some of them are many times more »iTipo a ^ 


for most medical students. Also, other values than those 
list might be highly relevant for many students. Some 
lead to improvements in measuremeot are discussed in . ,„rvgy 

Techniques of data collection other than that of the . micht 

might shed additional light on the theory. For example, the m g ? 
attempt to •‘simulate” «cupational choice by constructing a game u on 
«hich bears certain important correspondences to “ 

within which occupational choice takes place. This tcchniqu 

in Section 3.2 as well as in Chapter 9. it „ithin the context 

To the extent that the formulation is found to wor we , in 

of occupational choice, efforts might be directed at tes mg ^^n- 

othcr areas. One might succeed in defining much o uma formulation 

stituting choices and then seek to discover the uli ity o n 

for achieving comprehensive explanations and accur p 
range of behavior. 

SELECTED REFERENCES FOR FURTHER STUDY 

II »rn Rehaxior: dn Inxeniory of 
BEXVAXn HrRCLSON anJ Gary A. Stcimr. WnrlJ 1964). This b » 

Saenufic FindinRS (New York: *J^“'** human behavior, with 

collcciion of J(M5 findings from the scientific stuJ) oi 



INTRODUCTION TO RESEARCH METHODS 

OP SPCI01.E, .nd p^chology. Alttogh i. does “ 

.11 of th. ..soiaent .voilabl. l.^ledgo of human '>'1'™' f“ 
cannot be ignored in any general explanation of human behavi . 

Phillip E. Hammond (ed.) . Sociologists at IFo^' Estays onieje ,n 

Research (New York. Basic Books. 1964). proj- 

considerable detail the personal history associated with J 

ecis This IS the kind of information which is only rarely 
literature. It personalizes the research process by portraying i 
earned on by human beings seeking answers to questions and ha g 
with obstacles 

John Madge, The Origins ol Scientific Sociology (New J*'.® 

Glencoe. 1962). Major researches from the 1890’8 to the 1950s are 
here in considerable detail, thus providing an historical perspective . . 

ments in the field Within the context of each study, the author illustrates 


mental ideas about scientific method and theory. 


26 



Elements of Inquiry 


2,1 THE LANGUAGE OF SCIENCE 

The Ladder of A bstraciion 
The elements of scientific inquiry are concepts. 

Concepts are ways of perceiving phenomena; :.j 0 j, 5 _ -Hje 

about the nature of phenomena; theories are systems p P ’ 
three together may be viewed as constituting the language o • 

the essential features of any language is that it a s «c ■ . . endows 

certain characteristics of the reality to which it refers, is phenom- 

wl.h wide .cope, i,.,, a capacily .o 

cna. The concept of eight things, for example, is hi^ly planets 

scope; it selects only that which eight people and eight p jji^ciivity the 
have in common, 4 ich is little enough; yet because of th s selectivity, 
concept can be applied to a very wide variety p pon- 

FigSrc 2-1 presents a “ladder of abstraction- for "ply 

cepts become more selective or abstract, they ^ behavior 

to more and more phenomena. ScienUfic selective and which have a 

of human beings, constitute abstractions which arc the environ- 

degree ol -n„y c„.bl, .be .o .be 

ment, but they have a special, related funcuon for 
elements utilized in statements designed to cxplo 

The coLepts in Fig. 2-1 ascend in level of !o)L^Toice 

flnjH'er ro a question about his preferences emong tx-CMp 
behavior of human beings. The less abstract the con«p , . sy^j, as 

actual measurement procedures. More abstract «n P ■ because of 

the choice be/un'ior of human beings, can also me ^ j- situations 

.h.i. .cry «ide .copc, o.ly . small proportion cl .he .0 al 

'Thisidca of dravring « ladJer of absiraelion ^p,lj 1949), p 169. 
tuage In Thought anJ Action (New Yorl: Harcoun, Hra 



ELEMENTS OF INQUIRY 


[Cb.2 


The choke behovior of human beings. 


The choke behovior of living human beings. 


The choice behov’tor of living Amerkons. 


The choice behoviof of John SmiJh. 




^ The chokes '■•hKh John Smith moVes during o 

.i series of interviews, os well os under periodic 

observations, over o period of one yeor. 


1 1 John Smith's onswer lo o gueshon about his 
' ^ preferences among occupollons. 


FIGURE A ladder of Absiroclion for Concepts. 


to which they apply can be investigated by the scientist. This is the price 
exacted for such scope, but the retum may be great. The concept of choice 
behavior may became an element of a highly effective theory for explaining 
and predicting human choices. If indeed it docs, this will constitute a 
important retum, since the availabiUty of such a general theory in the social 
sciences would be analogous to important breakthroughs in the physical 
sciences. 

It is one thing to put together sense data in a certain way so as to come up 
with the concept of chair. It is quite another to state John Smith pushed the 
chair and it fell on its side. Here one is not merely adopting a particular con- 
28 



Sec 2 1] 


The Language of Science 


Any 9 ven human be ng ehooies iho* olterna* ve 
lowest expeeted value dcpr vof on for h m 


wh eh has the 


Any gven t v ng human be ng chooses thot olternat ve 
has the lowest expected voloe dept vof on for h m 




He 


Any 9 yen I V ng Amer con will choose on occopot 

the leost expected value dept vof on lof h 

favorably that oecupol on wh <h hos the greotes 
deprivaton for him 


on wh ch has 
xpecled value 


A.» g c=l . II 1..<I 1» .oi. I»»»;«W; " 

feld wh ch has the (east ® ,1, oreotest 

wll rote unfovorobly ihot medcol feld whch hos the greatest 
expected value depr vat on foe h m 


A medcol students p efeeences omong "’"J ^ tu* t ei 

fected by h s values ond h s e.pecfot ons os to the oppo 
for fulfil ng these values m the veewus fields 


John Smths preferences omong 
h s values ond h s expectot oris c 
fll ng them n the voe ows felds 


»,«i<ol r.idi !>■■■ br 

3, lo the Opportun t es tor ful 


«EAimr 


FIGURE 2 2 AtadeJerof AbstroclionforProposinor 

'cnlion for \ievnng phenomena but utilizing 

d-wibc something that IS happenme *'°"*P*^ a concept is ccilhcr 

ments but the) arc not statements in thcmselscs 

true nor false for It IS only a designation or naming houexr can 

name Srd„h The slalemenB made ha mil.emf coneerB 



ELEMENTS OF INQUIRY 


statements highly credible. 


•■7^:;e-2:?p....s a ladda, o, abs.,^ 

Like the ladder for coneepts, its lomr nltiES J . , attempts to 

range ol situations. The statements use in level ot J „a„p,s 

esplain and predict John Smith’s prei.renees among med. aUeld s 

tolsplain and pred.c, the ak-" 

behavior of human beings covering much mote j ^ ,bl5 

selects very little from tbe ongoing phenomena to " applied 

token, is highly abstract; its scope, however, is m aonceraed. 

to much ot the phenomena with which behavioral “lenti® a 
It should be noted that the statements at the lowest two m S 
Fig. 2-2 are set, of two pmposition, which are "■> . P“‘ “f " p'i 
systematic way. The two factors ot vnlner and .„f„. 

rnnilies (or (iil/illing them in rlie vorimu (icMs are eded “ is 

ences among medical fields. lust how these factors affec P 
Icit unspecified. The other rungs of the ladder, however, fill ® ^,5^ 

predicting that the alternative chosen is the I*" . fj, 

value deprivation. Such statements might be called theories, for th y p 

gether several propositions in a systematic way. „ . u .vj. ladder- 

Two major aspects of scientific orientation are implied by this 1 
skepticism and objectivity. If statements about reality cannot be provea 
elusively, then any scientific •‘truth” is subject to revision on the basis o 
additional observation. A skeptical attitude toward scientific data ana con 
sions is valuable in that it keeps the scientist’s mind open to new observa 
and ideas which might overturn the old. t, nos 

Scientific objectivity refers to attitudes and procedures which make 
sible for results to be checked by others within the scientific community, “s, 
for example, there is considerable agreement as to what shaU constitute cvi 
dence for a given proposition. If the reality can never be definitely ascertaine , 
the best that fallible humans can do is to check their observations and con 
elusions with one another. If these procedures do not yield knowledge whic is 
certain, they at least help to achieve credible knowledge. . 

Of course, decisions may be made on the basis of tradition or the advice 
of some authority figure without reference to the available evidence as define 
by the scientific community. Such non-scientific orientations are commonplace 
in the private lives of scientist and non-scientist alike. Scientific and non- 
scientific orientations ate aspects of two conflicting belief systems about the 
nature of effective procedures tor explanation and prediction, and one might 
say that each orientation fundamentally rests on faith in its own future effec- 
tiveness. The scientist cannot prove that the scientific method will work in the 
future, but he can at least cite the past successes which may be attributed to it. 



Sec. 2.1] 


The Language of Science 


Single-Level Abstraction 

SingWevel abstraclion in ordinary spoach “ 
strinEine toeether of vague generalities (the top of the ladder) or in he dun 
adSnaetd exposmo/^ highly speeiSe reports (the -ottotn^e 
der). Such procedures also have their effects on the setentdic process, and 
some of these are discussed by C. Wright Mills. 

The basic cause of grand theory is the initial choice of “ 

general that its praetitioners cannot logical^ problems 

never, as grand theorists, get down from the higher g 

in their historical and structural contests. This absence ^ 

genuine problems, in turn, males tor the nnrea ity 

pages. One resulting ehataclenstic ” * o ,*,gP „ur understanding 

endless elaboration of distinctions, which neither cnia g 
nor make our experience more sensible.- 

la addition to taking to task "grand thcoty," 'f™;” “ 

the ladder of abstraction. Mills also denounces a . both rigor 
where, by stoying as close as possible to the bottom ot the ladder, 
and triviality are simultaneously achieved. rj„ni 

Mills- highly Individualistic “"“P"”” “'f '“'tj^erfspcial scientist to 
over the •‘bureaucratitaiion of research. He wan ? movinc invesii- 

be his own theorist and his own methodologist wu ..jtjn* of theory. By 
gallons back and forth between the formulation an ^ 

posing the problems ot that scientiBc progress 

individual, however, one tends to lose sight of -u-rs of which build 

seems to depend on a community of scientists, t e jbeory and 

upon one another’s work. If a particular scientis ^ --Hancered provided 

neglects verification, scientific progress ma> not 

that others are stimulated to test the theory. theory tends to 

Of course. Mills might stilt maintain that I e ca . j j^e investi- 
sufler when the process of verification is not sai ent in . - j that 

gator. The reverse might be said about a lack 

the triviality of many findings in behavioral . oosition in fawr 

of interest in theory. At any rate. Mills takes a ve^' ^ stratepcs- 
of a particular research strategy and against cc evidence within be- 

lt must be admitted, howocr, that there is as je i ^verification process 
havioral science for this contemion.^ The nature ° investigation and 

constitutes one of the major distinctions between . jj, pon-scientific 
other attempts to arrive at an understanding o 

- iMew York: Orov« Prew. 

*C. Wrishi Mills. The Sociotogkal lmot>not«>n l. 

r 33. 


31 



ELEMENTS OF INQUIRY 

bt tested Although scientists do not airapt the such 

tain the natute of reality, sufficient evidence may be obtained 
statements highly credible. . . theories. 

Figure 2-2 presents a ladder of abstraction for propositions 
Like the ladder for concepts, its lower rungs deal ^ attempts to 

range of situations. The statements rise in level of abstracWon fr 
explain and predict John Smith’s preferences among me ica choice 

to explain and predict the choice behavior of human bemp— ^ formulation 
behavior of human beings covering much more territory. Sue a 
selects very little from the ongoing phenomena to which it re ers, a , 
token, IS highly abstract, its scope, however, is so wide that it can . 


token, IS highly abstract, its scope, however, is so wide inai u ----nied. 
to much of the phenomena with which behasdoral scientists *■ 

It should be noted that the statements at Ihe lowest two rungs o 
in Fig. 2-2 are sets of two piopositions which are not put toge e _ 
systematic way. The two factors of vufues and expectations as to t e 
fum'H'ej lor julfilUng them in the various fields are cited as affecting p 
encej among medical fields. Just how these factors affect such pre we 
left unspecified The other rungs of the ladder, however, fill in these 
predicting that the alternative chosen is ihe one with the lowest expc 
value deprivation. Such statements might be called theories, for they pu 


gether several propositions in a systematic way. . ... 

Two major aspects of scientific orientation are implied by this a ■ 
skepticism and objectivity. It statements about reality cannot be prove ^ 
clusively, then any scientific "truth” is subject to revision on 
additional observation, A skeptical attitude toward scientific data and con 
sions is valuable in that it keeps the scientist’s mind open to new observatio 


and ideas which might overturn the old. 

Scientific objectivity refers to attitudes and procedures which make 
sible for results to be checked by others within the scientific community. Thus, 
for example, there is considerable agreement as to what shall constitute evi 
dence for a given proposition. If the realiQr can never be definitely ascertaine , 
the best that fallible humans can do is to check their observations and con 
elusions with one another. If these procedures do not yield knowledge which is 
certain, they at least help to achieve credible knowledge. 

Of couRc, decisions may be made on the basis of tradition or the advice 
of some authority figure without reference to the available evidence as define 
by the scientific community. Such non-scientific orientations are commonplace 
in the private lives of scientbt and non-scientist alike. Scientific and noo* 
scientific orientations are aspects of two conflicting belief systems about the 
nature of effective procedures for explanation and prediction, and one might 
say that each orientation fundamentally rests on faith in its own future effec- 
Uveness. The scientist cannot prove that die scientific method will work in the 
uture, but he can at least cite the past successes which may be attributed to it. 


30 



Concepts 


See. 2 . 2 ] 

concepts in behavioral science, the meanings of the terms as used in everyday 
speech may obscure the meaning which the scientist is attempting to convey. 
For example, one definition of the concept “role” by behavioral scientists is a 
pattern of behavior associated with a distinctive social position. In the lay 
vocabulary, the term social position connotes the place of individuals m a 
social-prestige hierarchy. In the vocabulary of behavioral science, soaal posi- 
tion usually means a social category sudi as father, wife, engineer, aii so on. 

Lack of clarity may lead to statements which seem to provide explanations 
but which in fact do not. The following paragraph was constructed to illustrate 
this point. 


Psychopathic personality has been defined in many different One ex- 
ample makes reference to the characteristic of cmoltonal abnormali y 
the break from reality which characterizes psychotics. Another re 
lack of a fully developed conscience. A third definition refep V 

antisocial acts and criminal behavior. We will accept this a er . 

We come now to one of the basic questions which the s u en Although 
behavior is called upon to answer: What are the causes o cri • 
these are many, one of the most important ones has to o wi ^ 

ality structure of the individual. As we all know, crirne is y j. 

violation of the code of society, and it is °P^ comounciions 

unchecked by a socially developed conscience, which feels n J. .„fnce 
about violating social norms. Thus, a partial explanation for 
of crime is that it is in large measure a product of the psychopathic per 
sonality. 

Psychopathic personality, defined as antisocial acts and 
in the first part of the paragraph, is used in the amounts 

paragraph as a partial explanation for the occurrence o cn • . 

to saying that criminal behavior is one of the causes 

unwary reader, however, may read the Icnn riJ^havior. Because 

something different from its definition m terms of cn author, the 

he adds meanings to the term other than those tmpl.cd by the autho . 

reasoning docs not appear to him to be circular. mnt sviih the tradi- 

A critical approach to concepts and definitions is conw ;„jjj^5nt may 
tion of scientific skepticism. The different kinds of p-fen to 

he classified into three: scope, clarity, and ‘ concept appli« 

the inclusivcness of the class of situations to whic rreater scope than 

«pts at the higher rungs of the ladder of “'jjaf for mosing down 

those at the lower rungs); clarity, to the concept s ^ measurement; and 
the ladder of abstraction to an objective dciermina i . jn propo- 

systcmatic import, to the degree to which the concep is degree of 

Mtion, „d thVorks. Both Ihc scope of the pmpos.ooos aod t!.c,r 
clarity affect systematic import. 33 



tCh.2 

ELEMENTS OF INQUIRY 

orientations, “verification” is achieved Ihrou^ reference to 

the statements of “authority figures” The scientist, on t e 

tempts to verify his ideas by submitting them to mem 

munity of scientists for testing according to the rules o obiective 

ate generally accepted within this community. His procedures a > 

-i e , the detailed steps involved can be repeated and checked by any other 
member of the community of scientists. _ _ , the 

This testing process also provides an important distinction ^ 
method of the scientist and that of the student of the humanities. ’ 

the poet, the musician, the novelist, or the playwright may o e 
quite profound ideas or inaghts into various aspects of human 
It is the scientist, however, rather than the student of the humani , 

IS concerned with the objective testing of these ideas or insights. 

All the sciences play an important role in the verification ..j 

differ greatly in the degree to which they have actually validated 
and theories at a high level of abstraction It is common knowledge 
physical sciences have developed to a far greater degree in this i. 

the social sciences. Nevertheless, most behavioral scientists feel 1”® 
understanding of human behavior can be greatly increased. 


2.2 CONCEPTS 

Clarity and Scope 

Concepts, as has been explained, arc abstractions used by the scientist as 
building blocks for the development of propositions and theories whic e 
plain and predict phenomena They ate unitary and thus are not the pme a 
relationships between phenomena. For example, the behavioral scientist or i 
natily would not speak of “the process of occupational choice” as a concep r 
but would refer to “choice” and “values” as concepts. , 

concept represents a selection of certain pheno mena w hich are groupe 
or classified togeth^ Such a selection is neither true nor false; rather, it is 
judged m terms of its utility for the advancement of scientific knowledge. ^ 
degree of freedom from vagueness and ambiguity — i.e., how precisely it speci 
fies or connotes a particular set of phenomena (its determinacy) and how 
uniform its usage is for all users of the language (its uniformity of usage)—” 
will also affect its value for the communicative process. 

Clarity is greatly affected by whether or not the terms used to designate I c 
concept are part of the lay vocabulary. If they are, as is usually the case with 

* For a readable perspeclive on some of the relaUonships between the humanities 
the sciences, see C. P Snow. The Two Cultures. And a Second Look (New York: The 
New American Library, Inc . 1964). See also Lewis A Coser (ed.). Sociology througn 
Liieraiure (Englewood Chfis. N I • Premke-Hall. 1963). 

32 



leaving it to subsequent research lo discover whether these terms lend them- 
selves to the formulation of fruitful theoretical principles. But concept 
formation in science cannot be separated from theoretical considerations. 

In this view, concepts should not be treated separately from the proj^i- 
lions and theories of which they are a part. If concepts are to be evaluated m 
terms of their contribution to explanation and prediction, then their role as 
elements of existing propositions and theories (the major tools for explanation 
and prediction) — their systematic import — must be taken into account. 

This criterion is certainly not an infallible guide to the evaluation o a given 
concept. New concepts, for example, may have very little systematic impo 
and still ultimately achieve a great deal. Heavy reliance on systematic impo 
would yield a very conservative research strategy; neglect of systematic import 
would result in a failure to learn from previous research. If there are tew 
verified propositions with wide scope in a given field and, as appears to D 
the case in behavioral science, few systems of such propositions, 
very difficult to reach any agreement as to which concepts have sys 
import and which do not. In this situation almost all concepts aye 
systematic import, and their value lies primarily in their potentia . 

velopment of systematic import. When objective criteria, such as 
import, cannot readily be applied, the way is then open for P ^ . 
criteria— such as the authority of the person using the concep . . , 

the journal In which the publication appears, the non-systema ic 
the concept with others in vogue, the degree to which the 
words or grammatical constructions uncommon in the lay v * 

mere assertion that the concept is an important one, and so on. 

The extent to which scientists may be guided by these less pr g 
teria has been demonstrated by a tongue-in-cheek artic e . Ljijfjca. 

the effect of these criteria. One excerpt, in which a theory o 
tion” is presented, reads as follows: 

This constant contact (between fetus and 

tations, through conditioning, of further contact. \ jj fcnown. 

umbilical cord, an awareness of its absence is manifest. removed 

after parturition, in all societies and peoples the urn * j, 

frotnthe new born, either by cutting, biting, or letting I 10 be 

naturally, it ir a normal course for ike ^.lorf traces 

^fumbilificated. The absence of the umbilical . _,,nifcst in 

“sweiated with it. are the underlying reasons for the insecu y 
man.* 

‘ Carl G. Hempel, Fundamentals of Concept Formation in Empirical Scie 

Universiiy of Chicago Press. IM2).p 47. Pnchotottcal RerU*. 

, ‘ E4gif r. Borgatia. “Sidesteps toward a Konspeetal Theory, rr? 

CXI (1954). pp. 345-6. 


35 



lCh.2 

ELEMENTS OF INQUIRY 

On= ctap..t in . kndins ICHbool u«d in 
[hcsc terms; sell, socialization, inslmcl. symbolic intense i . 
eialization. I. me. IJ. ato. siipereyo. looUny^s sell. ‘'Jl'Z " 

irols, static adaptation, dynamic adaptation, adequate ego. , 

identification, significant others, naturat sign, non-verbal 
eralized other, other-directed, inner-directed, anticipatory socta 
c/a/J2flf«on, basic personality, national character, and Oedipus «itua- 

concepts all have considerable scope because they can app y to “ . • 

lions an>whcre and any lime, Some of them, e.g . non-verba 
have more scope lhan others. c.g.. anticipatory ic>c»flfi:o«on. 
former conccpl can be applied to almost all situations in whic «hich 

action IS taking place, the latter refers primarily to those situations tn 
the individual's actions help him to prepare for a future change in ro . 

Clarity has to do with the degree to which the concept implies ® ® . 

lower-level abstractions For example, the concepts Gtvfr 1. for 

beautiful mean very dilTercnt things to different individuals. It is^ i 
each to move down the ladder of abstraction to instances in which e P 
nomenon docs and docs not occur. Of course, no concept means csac y 
same thing to dilTerent individuals, and a certain amount of ^*ptiene 
even be useful, for it leaves the conccpl open to many application'*- 
example, social Interaction may be defint^ as the proertt in vshich one 
Jon'r fcehavior aOecis another perjon’j he/ttivi'or. But. in a sense, every 
behavior is affected — however slightly or indirectly— by the behavior ® 
other person Should we then limit inrerricr/on to facc-lo-facc situations? 
would exclude the interaction between an author and a reader or between 
proponents of an ideology and the masses of followers. Thus, sometimes 
would be useful to investigate social interactions which arc not facc-lo-ia • 
It might even be useful to study indirect interaction through a number o 
intermediaries. The vagueness of the concept interaction, as defined above, 
provides a “tent" within which all of these investigations may take place. 


Syifemir/i'c Import 

In addition to the criteria of scope and clarity for the evaluation of concepts 
we have that of systematic import. In the passage below, this criterion is pitted 
against that of formulating precise "operational definitions," with the latter 
being closely related to the criterion of clarity. 

In the contemporary mcihodological liieraturc of psychology and the social 
sciences, the need for "operallonal defimlions" is often emphasized to ihe 
neglect of the requirement of systematic import, and occasionally the 
impression is given that Ihe most promising way of furthering ihe growth 
of sociology as a scientific discipline is to create a large supply of "opera- 
tionally defined" lenns of high detcrminacy and uniformity of usage. 

34 



The Process of Inquiry 


3.1 GOALS OF BEHAVIORAL SCIENCE 
Means and Ends 

Concepts, propositions, aod theories provide the structure or content of a 
science, but these ideas are developed and tested by indivjdu^s w o 
certain goals and certain investigative stratepes and tactics. It is V. ’ 
dynamic process of investigation which must be understood u one is o 
any adequate comprehension of science. In looking at ^ 

carried forward by individuals, our major point of reference wi g 

that scientists as a group are trying to achieve. 

These goals were implied during the treatment of the language o > 

propositions, and theories (see Chapter 2), and it remains to 
implications. Concepts were defined as absiraciions h7»c/i are wsea >■ 
scientist for the development of propositions which explain an pre ‘ 
nomena; propositions were defined as statements about the nature o > 

and theory was defined as a set of propositions which are syslema ic 
related. From these definitions, the goals of science seem to mvo . . 

of reality, prediction, and explanation. These three ideas wi e 
succession. 

The Nature of Reality . . 

In Chapter 2 a framework was conslnicted wherein the 
»cnce as to the nature of reality are vievred as being ^dam J ^ 
Scientists have learned to live with this uncertainty. The scic J ^ 

«ems useful in coping with this uncertainty, because it at eas scicn- 

rollective estimate of the nature of reality for an individual one. I 

sees the world somewhat differently from every other ^f 

ngrecmenl among them is viewed as providing an improv 
I reality. Of course, the community of scientists as a whole may 
has happened often enough in the history of science. 



[a. 2 

ELEMENTS OF INQUIRY 

Botgatta uses all possible ploys, and the® some, to „jll-known 

deumbilifieation; its extensive scope is emplmi^ , deifittbilWcathn. mm- 
soeiologists is footnoted, new tetms aie intr^uced ( „„.pmOB 

mart envy, Lenny complex, pmon-grmp. wel!-lcno»® theories 

group, mcetophystaan), numerous words are und • ,j„ctions and 

and methods ate constantly evoked, unusual gtammatica j pke 

phraseology are employed, and a generally 11“*'"?'°“ ^ article sen- 

result was that many professional soaal scientists . • .^,e value 

ously and at face value This, perhaps, should serve o e p 
of scientific skepticism as well as the fact that many scie 
versed m utilizing scientific skepticism. 


Extreme Operationism and Operalionism 

Behavioral scientists vary m the degree to which they emphasize 
of a given concept and the case of developing objective measure jjarity 
of that concept. The tendency of some behavioral scientis s concept 

and ease of objective measurement as the most important cn Under 

formation may be referred to as the strategy oi extreme opera ' . jg. 

this approach, a concept such as “goal*’ might not pass difficult 

gardless of its relatively high degree of systematic import, it is 
to measure than many others. Another kind of concept which w 
jected is exemplified by Max Weber’s ideal types: 


An Ideal-type is formed by the one-sided acceniualion of one or mor P 
of view and by the synthesis of a great many diffuse, discrete, 
present and occasionally absent concrete individual phenomena, w i 
arranged according to those one-sidedly emphasized viewpoints in o a 
fied analytical construct In its conceptual purity, the mental construct ca 
be found empirically anywhere in reality. It is a utopia.® 

The purpose of setting up ideal types is to help the investigator 
concepts which may prove to have systematic import. Thus, for 
ideal type might be the “urban personality,” conceived of as having a o 
characteristics associated with cosmopolitanism, urbanity, and sophistica i » 
and none of the characteristics conflicting with these. Although such 
alities do not actually exist, this conception enables the scientist to talk 
the various degrees of "urbamly.” He can then treat degree of urbanity as 
variable and proceed to relate it to other variables, thus testing propositions 
and theories. 

The ideal type is an example of a cocKept referring to phenomena whic 
are not directly observable. The word construct is often used to denote sue 
phenomena Historically, many constructs (such as perfect vacuum) have 

• Max Weber, “Objecliviiy in Soaat Science and Social Policy." in Edward A Shib 
and Henry A Finch (irans. and ed ). The Methodotogy of the Social Sciences 
rork Tbe Free PrtsJofGleocoe, 1949 ). p 90 



s^c. i.;i 


Goals of Behavioral Science 


measuring instruments limit the accuracy of predictions. Furthermore, many 
aspects of the physical world have not yet yielded to precise predictions (a 
classic example is the weather). When we move from the macroscopic to the 
microscopic world, our capacity for prediction is drastically reduced. 

It is also possible to pr^ict accurately certain aspects of human behavior; 
if it were not possible, life as we know it could not continue for very long. 
Who would dare to drive a car if he did not have confidence in the prediction 
that there is very little chance that one of the hundreds of thousands of drivers 
he will meet during his lifetime will deliberately seek to collide with him 
head-on? We can also expect, with a high degree of confidence, that our 
close friends will not betray us, that our currency can be exchanged for goods 
ind services, that we will be taxed periodically, and so on. 

la addition to the regularities that arc obvious to most individuals, social 
scieafisls have unearthed many regularities that arc or once were not so obvi- 
ous. There are, for example, the uniform patterns of behavior, attitudes, 
values, and social interaction exhibited by the members of a pven social class, 
* liven family, or a given sodety.* 

As in the case of the physical scientists, all predictions by social scientists 
w fundamentally uncertain and limited in accuracy. If we think of predic- 
tons ia terms of probability rather than certainty, it would be correct to say 
predictions in the social sciences generally have a lower probability of 
wccess than those in the physical sciences. Wc can predia, for example, that 
wnlact between a majority-group member and a minority-group member will 
M to reduce the degree of prejudice of the former with respect to the latter, 
w Such a tendency” prediction will prove to be correct little more than 50 
^rcent of the time, for many other factors that affect the reductiem of preju- 
*”10 account Some of these arc the relative social statuses 
mem^ the normative expectations of the majority-group 
*'Sriificant others,” the degree to which the minority-group mem- 
coniact rllustrate the negative stereotypes of his group, whether the 
QtiMm, socially defined as intimate or impersonal, and the fre- 

^ duration of the contact 

reasons why the difference in predictive capacity between the 
‘“ustruc^ sciences is so keenly fell k that the physical sciences have 
to V* successful technology which is almost universally re- 
®’®parabl r®utributcd enormously So man’s welfare. The lack of a 
^ ^ in the ^ social sciences is appallingly obvious, whether 

intergroup relations, international politics, crime and 
iilness, self-realization and creativity, family relationships, 
Tbtfg 1°^ and services, or organization and control of power. 

r^urse, been many efforts to achieve the kinds of predic- 

Gary A. Steiner. Human Behavior: An ln\entory of Scien- 
Harcoun, Brace 4 World, 1965). 


THE PROCESS OF INQUIRY ^ 

Which aspects of reality are behavioral j,aped S dft- 

The broad emphasis is on that kind of behavior whic heredity We 

fermt ways by the environment and is not predctetmne y 
might add here that they are not diieefly^cerned 

that is ptedetetmined by physical lavre. Thus, i! governed by 

ping on a banana peel or jumping out of an airplane, n inter- 

physical laws, not human choice. The behavioral scientist, o cou , ^ 
csted in why an individual comes to step on a banana ° hiolocical 

plane. These are types of behavior which are not predetermine y 
or by physical laws — i.e., they ate amatter of choice. op or shift 

Choice is not a static phenomenon; rather, it involves a c ang „ 

from one state to another. We may speak advisedly of the c oice . 
thus implying that there is a sequence of events which leads to a giv 


or a series of choices. . pngrally 

The scientist seeks to find order amid such sequences, because e g 
believes that the nature of reality is based on order rather than c 
that much of this order has to do with cause-and-effect _ ,1,41 

phenomena. He realizes that he cannot scientifically prove with “ ^ Lggjyje 
any given relationship among phenomena is one of cause and cncct. 
of this he may not even utilize the terms cause and eSfd >0 
tions; Instead he talks about relaiionships among variables. ’ 

cause-and-effect relationships among phenomena that ore involved in ^ « 
choice processes seem to constitute the major focus of behavioral scien • 
(A further discussion of the causal process will be presented in Section • • 
The focus of behavioral science is closely related to human goals. Of co 
these goals vary among individuals. The goals of the scientific community, 
they include all the goals of all of the scientists, would deal with many t mp 
that are unrelated to the scientific enterprise. We might instead speak 0 
general consensus within this community as to the goals of their ' , 
efforts. Although there would be considerable disagreement, we might n 
two broad classes of goals: the prediction of phenomena, and the explanation 
of the nature of reality. 


Prediction 


Scientists are interested In prediction in order to be able to cope effective y 
with their environment. Predse predictions constitute one of the most impre* 
sive achievements of the scienust. The astronomer has been able to predic 
accurately the positions of the planets in our solar system many years ahea 
of time. Tbe physicist's knowledge of the relationship between distance falleo 
and time in free fall of a body in a vacuum can be used together with other 
physical laws to predict the Uajectory of a missile. There are limits to the 
accuracy of these predictions, however. Fundamentally, there is no guarantee 
that the scientist’s predictions will actually come to pass. In addition, available 
so 



[Ch.3 

THE PROCESS OF INQUIRY 

lions on which such a technology might he based. the basis 

formation, colkarf by penal .nst.lut.ons m New Z'”'™ ^ 

lor Albert Mortis' attempt to toelop on "r" (-rhe method 

pred.et which individuals would escape from Js, After m- 

he used follows the general pattern of *at used by the abseondcts, 

vestigating a number of factors that m.ght diRcren la effectively: age, 

he selected six factors that seemed to so ddferent.ate 

marttal status, present offense, previous olfeMS, P™"”'.’' then 

number of previous escapes. A total score for a gi 

computed on the basis of these (actors. escapes. 

These total scores were then compared wrth the records 
This is not prediction as the word is ordinarily understood. predicted 

parison merely indicates how well the total scores juch 

escapes if they had been collected bejore the escapes. „„dicti’ons of 
information is quite valuable for the development of effecti e p 
future events. The results of these comparisons were “V A joUl 

respect to those individuals who actually did escape, 88 «rpuracv so 

scores indicating a high escape risk; and (2) in spite of « 
many individuals received total scores Indicating a high M p 
actually did not escape that such scores would have been inefiectiw 
diction of escape tendencies. In other words, the scores would P ^ 
excellent predictions for the small number of individuals ^ ® ® 
escape. But so many other individuals who subsequently did not es p 
ceived the same total scores that the total mixture of scores indica ' ^ ^ 

escapes would have been quite inaccurate. As a matter of fact, on y P j 
of all those individuals for whom escapes were predicted actually i ® ’ 

while 92 percent did not. (The latter might be called the “false positive , 
erroneous predictions of escapes would be involved.) ••tv. tudv 

Predictions often must be quite accurate to be of genuine utility. The s 
of absconders provides an example of the development of a rather goo 
for discriminating between absconders and non-absconders, but it was 
sufficiently selective for the demands of the practical situation. In gener , 
is extremely difficult to predict occurrences that are relatively rare. 

This example illustrates the difficulties involved in efforts at pre ic lo • 
Although it is not always necessary for predictions to achieve a high egre ^ 
of accuracy in order to have general utility, greater accuracy than has ye 
been generally achieved is necessary for the development of significant tec 
nologies for dealing with most human problems. The path to more accurate 
predictions seems to lie in the further development of theories out of w ic 


* Albert Mortis, Abxondtrt From Penal Inttuultons (Wellington. New Zealand. D 
parimenl of Justice. 1961). w 

‘Sheldon Glueck and Eleanor Gtueck, Predicting Ce/ini7uency and Crime (Cambriog . 
Mass : Harvard University Press, 1959 ). 

52 



Sec.J./J 


Coals of Behavioral Science 


more accurate predictors may be put together. At least this seems to have 
been the process in the physical sciences. Predictive indices which are not 
solidly based on theory tend to be limited in scope. They may seem to be 
accurate in reference to the specific groups on the basis of which they were 
constructed, but this accuracy tends to decrease when these indices are ap- 
plied to slightly different groups or atuations. A well-developed theory, how- 
ever, is sufficiently comprehensive and systematic to account for changes in 
any factors that are relevant to the prediction at issue. ... , 

This relationship between theory and prediction suggests a distinction be- 
tween prediction and prophecy. Scientific predictions are based on proposi 
tions of the form, “If A occurs, then X will occur." While predictions are 
conditional statements, prophecies are not; they are of the orm, « 
occur.” Prophecies thus are not scientific in the sense that they o no en 

from propositions or theory. . . 

Sometimes it is a useful task for the social scientist to attemp ° . . . 
prophecies about human behavior into predictions. For example, m i 
who are unusually perceptive about themselves or othere may a ® 
accurate prophecies about certain events. However, if f ^ ac u . 

such effective prophecies is considered, then the prophecies can , . . 

into predictions. Here, the propositions on which such jjj. 

based would combine the factual basis for the prophecy -,-Hic 

the prophecy "A* will occur." ^Vhcn the two are combined, we have a preoic 
lion “If A occurs, X will occur.” 

Explanation 

In lay language, the term explanation need have little 
tific pries*: as when one explains the reasons for his actions. The snenus^ 
however, does not consider this an explanation for I e ae i^ .-pjjnalion 
unless there is acceptable evidence for it. The ind.v.dua s 
is quite fallible. For example, he may be unaware of the me socially 

actions, or he may desire to emphasize those reasons whic a 
acceptable and omit those which are not. .. . •_ ,u,, Koih are 

In its scientific usage, explanation is similar to pr icuon i for 

tied prc,po.i.ion, and Iheorie.. To the extent that there 
propositions and theories, they can and are used by scicntis 
as well as predictions. . . .. one or 

Scientific explanations usually describe the . ^5, of ^ degree 

more causes and one or more effects. They are ° . .. explain his 

of evidence for this relationship. For example, an ‘ -uanf, cations for 

'^te for a given candidate on the basis of the can i cxpla* 

»«■«. Asroming .hnt the ecicnrb. h» facts to the conttaty. ajy 
nation will not be acceptable as a scientific one. For ex p . that 

list may, from interviewing the individual, have good 



THE PROCESS OF INQUIRY 

lions on which such a technoloar might be b^ed. basis 

infonnation, collected by penal institutions m New 

for Albert Morris’ attempt to develop an J'j' • 2 (The method 

predict which individuals would escape from I.,,) After in* 

he used follows the general pattern of that used by the _ (j^gonden, 

vestigating a number of factors that might differentiae tr£g,ively: age, 
he selected six factors that seemed to so differentiate m , 

marital status, present offense, previous offenses, . . . | then 

number of previous escapes. A total score for a given 1 
computed on the basis of these factors. escapes. 

These total scores were then compared with the recor s P 
This IS not prediction as the word is ordinarily understoo . 0 
parison merely indicates how well the total scores cou .gcb 

escapes if they had been collected fce/ore the escapes. Neve 
information is quite valuable for the development of effective p 


future events. The results of these comparisons were twofold. _ 


respect to those individuals who actually did escape, 88 percent ^ 


I actually oiu otopc, „ jg 

scores indicating a high escape risk; and (2) in spite of this accu 
many individuals received total scores indicating a high escape 
actually did not escape that such scores would have been loeffec ive 
diction of escape tendencies. In other words, the scores jid 

excellent predictions for the small number of individuals who ac X 
escape. But so many other individuals who subsequently did not es p 
ceived the same total scores that the total mixture of scores ..got 

escapes would have been quite inaccurate. As a matter of fact, only P , 
of all those individuals for whom escapes were predicted actually di ao ■ 
while 92 percent did not. (The latter might be called the “false positives, 


enoneous predictions of escapes would be involved.) , 

Predictions often must be quite accurate to be of genuine utility. T^ 
of absconders provides an example of the development of a ralher good 
for discriminating between absconders and non-absconders, but it 
sufficiently selective for the demands of the practical situation. In gener , 


is extremely difficult to predict occurrences that are relatively rare. 

This example illustrates the difficulties involved in efforts at predictio ^ 
Although it is not always necessary for predictions to achieve a high degre 
of accuracy in order to have general utility, greater accuracy than has ye 
been pnerally achieved is necessary for the development of significant tec 
nologies for dealing with most human problems. The path to more accura e 
predictions seems to lie in the further development of theories out of whic 


* Albert Morns, Absconders Fi 
parlment of Justice, 1961). 

’ Sheldon Glueck and Eleanor Glueck. PreJi. 
Mass : Harvard University Press, 1959) 


Penal Institulions (Wellington. New Zealand: De- 
ing Delinquency and Crime (Cambridge. 



See. 3.1] 


Coals of Behavioral Sciertce 


the propositions or theory on which a pven prediction is based could be 
utilized to develop an explanation. 

The relationship between explanation and prediction can easily be con- 
fused if we do not conceive of them as matters of degree. For example, it 
might be maintained that wc understand and can explain the effect ot the 
various forces that act on a falling leaf without being able to predict its rest- 
ing place. As a matter of fact, we are able to make a prediction about its 
resUng place, although this predicUon will be a probabflity statement rather 
than one which is completely specific as to exact place. In addition alAough 
our understanding and explanation of the forces operating on e e is 
great, it is Umited. If the theory on which such explanations are based im- 
proves, we mi^t in fact be able to make a far more accurate prediction about 
where the leaf will come to rest. . 

Explanations teU us why things happen either by focusing on the phe- 
nomena to provide greater detail or by placing the ® 

larger framework. In order to explain why a light bulb emits hgh w . 
switch is turned on, we may refer to a sequence of detailc ’ 

as the completion of an electric circuit, the difference m P®*®” ^ . 

the two wires connected to the socket, the flow of electrons ™ ^ ^ 
circuit, the resistance of the filament to the flow of electrons, an s . 
may also place the occurrence within a larger framework by re e 8 
the expectation of the individual who turns on the svntch that \ ^ . 

lowed by the emission of light from the bulb, the desire of this 
read a textbook, his desire to complete an assignment for a • __ 

est in learning as well as in obtaining an adequate grade, ^ ^ o 

explanation may be thought of either wth reference to Even , r » » 

occurring between X a switch being turned on. and Y. a bum 
emitting light, or with reference to the Events M. ’r4-«<5 is 

to X. (Further discussion of sequences of events within t c cau p 

presentedinSectionl6.1.) . __ inHuded: 

Explanations are selective in that only some .-nianation 

those which are believed to be relevant. To the extent t a -i-ctrons, 

depends on well-developed scientific theories about the mo i , ^ed as 
say, or the decisions of human beinp, the patt^s o ^2^^^ js 

relevant constitute deductions from propositions. Thus, or c , 
a general proposition that a potential difference is that 

Irons to flow in an electrical circuit, or there is a ^ Heines. These 

motives or goals arc instrumental in producing action in u .v-jj'gjpiana- 
explanaiions are generally considered to be more epen backed up by 
lions not supported by such theories, because the former packed up 

masses of evidence relating to many different types o p '"f . jpjfotiSc 

like prediction, explanation can serve both X S make 

functions. Even if it is generally not possible for the soaal sc 



ta.3 

THE PROCESS OF INQUIRY 

what he calls "act meaning” {the between 

self) and “action meaning” (that made by the scientist) . co . , jjje 

these two has sometimes been made by sociolo^ts who have p 
concept of verslehen (understanding). The process of ^ imaeming 
an attempt by the investigator to explain the acts of another y 
himself to be in that person’s situation and then introspecting as o _ 
involved Thus, the investigator confuses his own act meanings ^ 
thetical situation with action meaning. The basis for such ^ W et « 

human beings share many experiences and have the capacity to c 
However valuable such inferences may be as hypotheses, . ^ . ypjji 

not be correct in reference to the actual act meanings of the m ivi u 
observation, Furthermore, however valuable they may dditiooal 

to infer action meaning, these inferences must be combined wi . jgj 
facts and theories before an adequate scientific explanation of the 
can be formulated. jnlana- 

Just as predictions may vary in their accuracy, so is it true tha^ c p ^ 
tions may vary in their completeness. The aforementioned voter s m 
lower taxes may be only one among a number of factors affecting ' 

This range of variation for both predictions and explanations is un e 
able when we take into account the fact that they both rest on scientific p ^ 
sitions and theories. Propositions and theories vary greatly in their s P » 
thus they may specify very few or very many of the relevant factors invo 
in a given situation. . . 

Even if all of the relevant factors are included in the theory, their m 
relationships might not be described very precisely by the theory. For ^j^jbip 


pie, the theory might include a proposition that there exists a . , ,v j. 

between the goals of voters (including interest in lower taxes) aod c 
political preferences, but it might go no further than that. It might not . 
the type of relationship between these goals and political preferences. If 
were true, predictions based on this theory could not be highly accurate, no 
could explanations be complete. 

Is it possible to have explanation without also being able to predict, or to 
have predictions without also being able to provide explanations? No, if ho 
are understood to be matters of degree and to rest on propositions and the* 
Dries. Thus, a very incomplete explanation might be based on a proposition 
of quite limited scope; a prediction based on the same proposition could c 
developed, although it might be a lughly inaccurate one. In the same way, 


54 



Context of Discovery 


See. 3.2] 

the evidence indicates that the ideas are incorrect, thus the investigator is 
directed toward new efforts within the context of discovery. 

Unless the investigator recognizes this interdependence, he may attempt to 
evaluate strategies for the context of discovery on a basis other than the evi- 
dence for or against them presented in the context of justification. This is 
especially true in a situation where there is little evidence available as to the 
merits and demerits of every research technique for a given situation. This 
seems to be the situation in behavioral science; the tools are available for 
evaluating the various strategies of discovery, but thus far very little of such 
evaluation has taken place. This, of course, encourages the evaluation of 
strategies of discovery on other grounds — e.g., the prestige of their pro- 
ponents. (Additional discussion of various strategies of research will be pre- 
sented in Section 3.4.) 

One of the most important ideas used by scientists within the context o 
discovery is that of cause and effect. Many scientists do not like to state their 
propositions in these terms because science is not in a position cither to prove 
that such relationships exist or to prove that any given relationship is one o 
cause and effect. But these scientists generally continue to think in terms of 
cause and effect. Is it wise for them to give up this idea because of its uncer- 
Uun status? Max Planck answers this question as follows: 

The law of causality is nehber true nor false. It is. rather, a si^post— 
and, in my opinion, our most valuable signpost— to help us find our 
in a bewildering maze of occurrences, and to show us the direction in w ic 
scientific research must advance in order to achieve fertile results. e aw 
of causality, which immediately impresses the awakening soul of t e c i 
and plants the untiring question tVhy? into his mouth, remains a li e ong 
companion of the scientist and confronts him incessantly with new pro ems. 


Losic-in-Use and Reconsiriicled Logic 

A distinction which is important for an understanding of the coatex^f 
discovery is that between "logic-in-usc” and “reconstructed logic, 
methods actually used by the scientist in investigating phenomena ’ 

trations of logic-in-usc; the rules, procedures, or methods of .. 

arc systematized and codified arc examples of reconstructed ' 

tinction is between what the scientist actually does, which has a ogic o 
and idealizations of scientific practice. „vnr’s 

"This book, for example, constitutes a reconstructed logic, w _ 

^ual behavior in performing research constitutes a logic-in-use. 

may differ to a greater or lesser extent. The researcher’s actual behavior very 

’ Planci. The Concepi of Causatuy In Phyncs.” in 
‘"n tn the rhihwrhr of Saritce tNe* York: Charles Scribner s Sons. 1953 1, p- 
kap!sn.pr*.f,',.pj,. 3-11. 


57 



THE PROCESS OF INQUIRY 

very accurate predictions about human bchavlof, develop 
planaiions of varying scope and validity which can direction^or sden- 
techniques for (fenline ssilh muny problems and lo P'™' J '' , „pla,,stions 

tiSc invest, Eulion. The scientist should Iherefore >» 'Sc, to 

which, instead, tend to close the door on further research. ( (p, 

postulation of “instincr as an explanation of many diftcren 
havior tended lo discourage the search for non-biological facers.; 

Explanation is also pursued by the scientific community ° scientists 
and this seems to be one of the most important factors 
generally. Numerous scientific investigations take place whic 
no immediate or long-range hcncfiis to society other than c ^..eiallv in 
knowledge. Many laymen find such a motive hard to tindcrstan , e p ^^^5 
a society such as that of the United States with its emphasis o 
applications. Of course, the history of science is full of examp ® 
which appeared that way originally, but which ultimately had cno 


nolo&cal repercussions. _ ,• n< of social 

FHnciionflliim represents an approach to providing cxplana lo 
systems and human behavior which, together with the technique o 
aids in the development of hypotheses. A functional explanation c ^ 
tention to the needs or requirements of a given system and consequen ^ 
the investigator toward observing phenomena he might ordinarily o 
As in the case of the vtrsiehen technique, however, evidence is nceae 
much credence can be given to any functional explanation; categorica 
ments about the “functional prerequisites** or the “functlonat nccessi i 
a given system do not have any special immunity from the testing pr 


3.2 CONTEXT OF DISCOVERY 
Dijcovery and 7njli^cflfiort 

One distinction that will aid u$ in viewing science as a process is that be 
tween the conieri of discovery and the context of justification. When we vic 
inquiry within the context of discovery we are concerned with the strategy a 
tactics leading to the development of ideas that may result in scientific pros 
ress. Ideas may or may not bear fruit, and it is within the context of justi ica 
tion that evidence is presented and decisions about the worth of ideas are 
made. 

The two contexts arc closely interrelated. The ultimate test of strategics 
tor the context of discovery, as well as for ideas developed within that contex , 
is provided in the context of justification. The results of tests within the con 
text of justification suggest the directions tor future inquiry. Evidence may c 
provided which indicates not only that the ideas are worthwhile and 
be followed up but also what kinds of new data are most urgently needed. I 
56 



Contexi of Discovery 


Like any device used in the context of discovery, models have their possi- 
bilities and their dangers. It is believed that the human mind learns best about 
the unknown by proceeding from the known, and the utility of scientific 
analogies or models lies in the fact that they dramatize the implications of 
the known. The use of models is standard practice in pedagogy, addition 
and subtraction, for example, are often taught with the aid of concrete entides, 
and the student is often tau^t the most abstract formulations with the aid of 
less abstract ideas which he has previously learned and which provide hm 
an intuitive understanding of the more difficult material. It is always possi e 
to develop a model of a given phenomenon, for there is always some simi an^ 
between the system which constitutes the model and the system which consti- 
tutes the reality. The question the researcher must answer, however, is 
whether or not there is something further to be gained by setting up a * 
that goes beyond the apparent similarities with reality. Might sue a ™ ^ 
serve to suggest new ideas as to the nature or function of reality, i cas a 
might not otherwise be apparent? , ,. 

One of the dangers of dealing with models Is that they may be t ‘ 
ally and come to be treated not as an hypothesis but as the ‘ s. 

cation of models leads to several different types of errors. If e 
accepted as the reality rather than as a tentative statement ab^t re ity. 
investigator may be led to close the door on further research. Or e y 
corporate in the model factors that arc irrelevant to 
investigation or exclude factors that are important. ‘t * . 

bered that, although models can suggest potentially fruitful ^ ’ 

can also suggest erroneous hypotheses and serve to prevent the c P 
of certain relevant hypotheses. . .. _ ... 

The term “model” is often used in a more specialized sense - 

just elaborated. It has frequent usage as a synonym for t .^i^nthts 

connotation of highly systematic or mathematical theory, ocia . 

®Iso employ the term “mathematical model,” which designates a ry 
in mathematical terms, ^ trsininc 

Although models have important applications for pedagop' -S-* 

(the link trainer), or in the study of economics (hydrau ic m 

national economy)— it is their potential research function which is 

here. Operating modeb generate their own data, as distinct rom ^ 

o( absuaction presented in Figures 2-1 and 2-2. (The latter 

Wrae aspects of the abstracting process, but they are not oper 

in the sense of continually generating their own data.) 

An imponan, principle in toxiopins nodels k the dear 
t»t.ce„ u,e model and the reality being tnodeled. In the f 

called “Careers,” for example, the game may be to corn- 

process of occupational choice (the reality). It would be ins 
pare a theory of occupational choice developed withm c 



lCh.3 

THE PROCESS OF INQUIRY . 4 

ofun departs from his ideas on the way His rKon- 

he may be aware ol sach departures truly to a l.mued a ^ buthlsptes- 
strueted logic might call tor trying out a wide range of meW , b 
ent stills and the exigencies ol time may prevent him S presfigeM 

It may occur that a given logio-in-use rs ^“1“™' l^^lpLary to 
reconstructed logic available. In such situations it -^constructed logic 

such a logic-in-use to be codified and expressed as difficult to 

i, it is ,o%ain widespread accepranee. Smth aceeptance 

obtain in spite of the availability of evidence in its f • than as 

prestigeful reconstructed logic may be treated “ the history 

in need of evaluation within the context of justification. H -econstructed 
of science seems to show that in time the evidence for a giv 
logic makes its weight felt. „ “um- 

II is when the proponent of a pven reconstructed lo^ .^„-,-„cted logic 
pirc” ’ in the game of scientific discovery and rules out all re jnvestiga- 

which does not conform to his own that potentially fruitfu ‘ 
tion may be blocked. This approach on the part of leading $ci . . jjjjj 
the practicing scientist to distrust his own ideas about researc 
thus fail to develop and test them Instead of feeling a sense c ^ 

autonomy for constructing the most appropriate research tec niq . 
pven situation, he may feel a sense of Impotence in the face o 

In psychology, for example, certain kinds of highly controlled jj, 

are revered while less controlled “field experiments" are often dispa ^ 
sociology, the large probability-sample survey is often valued for 
sake, whereas the small non-probability-sample survey is lightly regar 
anthropology, very detailed interviews repeated over long peno s 
are regarded by many as the sine qua non of research methods. 


Models and Operaiins Models 

Simulation, one of the procedures which behavioral scientists have 
useful for the development of ideas within the context of discovery, is ^ 
on the idea of a “model.” . • 

Models may consist of words, mathematical symbols, pictures or p ys* 
objects. They are used by the scientist as “analogues” of some aspec 
phenomena in order to suggest ideas or draw comparisons on the nature 
the phenomena. They often deal with comprehensive systems and, as sue ■ 
encourage the development of ideas for systematizing the relationships _ 
different aspects of phenomena. Simulations are operating models of p c 
nomena and, as such, they generate data. 

’ K»pUn. op. at , p. 25 He refers to the rote of the meihodologisl as a “coach 
advice is simply ihe voice of expcnence. rather than as an umpire, whose rules must 
followed 

58 



The Human Equation 


Sec. SJ] 

formulation utilized in the medical student study)? Can the EVD formu- 
lation be improved so as to yield highly accurate predictions? 

5. Even in relatively simple situations where expectations as to the oppor- 
tunities for fulfilling goals are relatively clear (as in the game where 
these descriptions are recorded in the squares within each occupational 
path), to what extent are these expectations received and interpreted 
uniformly, and how does this affect choices? 

6. To what extent do motives or transitory values developed in specific 
situations (such as the desire to “bump” a player simply because he is 
behaving obnoxiously in the bargaining situation) affect choices, and 
can such a process be incorporated within the theory of choice as a 
function of values and expectations? 

7. To what extent do the objective alternatives open to the individual 
(the kinds of “opportunity cards” and “experience cards he has), as 
well as his own awareness of them and their implications, affect choices. 

It should be noted that operating models such as “Careers may be used 
to transform into advantages many of the difficulties often involved in the use 
of models. The questions formulated do not reify the model; rather, t ey 
focus on the reality itself, and the model helps to direct attention to this realty. 
Furthermore, this particular model seems to have opened doors to u er 
research. Many of the questions were formulated as a result of playing e 
game, and such experience with the game is suggestive of similar games w tc 
may be used to test the effectiveness of predictions based on a theory of c oice 
behavior. Furthermore, it was the lack of correspondence or isomorphism 
between certain aspects of the game and aspects of reality which serve 
focus attention on possible research questions. 

For example, it became apparent that goals other than those of 
fame, and money points were involved in players' behavior. Some o 
'‘ere rather subtle— e.g., the desire to “bump” another player regardless oi 
whether it helps one to win the game. Players seemed to have little awarenw 
of such additional goals. All of this calls attention to the wide range p 
W'hich seem to affect human behavior, and it provides clues as to tec niqu 
for measuring subtle values. 


3-3 THE HUMAN EQUATION 

There is general agreement within the various 
is possible to investigate and analyze human behavior scienti ca >, . 

agreement does not > et exist outside the field. There arc many 
about the nature and possibilities of behavioral science, an ey - 

jiieply rooted. Such misconceptions arc to be found ^en among s w 
i^hasioral science. The best way for behavioral science to ce p 



[Ch.J 

THE PROCESS OF INQUIRY 

game with theories formulated in the context of more reahsUc c oicc 

ntrivcr decides on the lela- 
At the beginning of the game^ Careen, P ^ himself, giving 

live importance of ‘•happiness, fame, and y are kept 

each any number he desires for a total of 60 pom s. . .jj,- the course 
hidden i,om other pinyers, end the, mn, no. >■' f o.ig- 
ot the game The object ol the game is .o collect „a, be 

inilly specified ol “happiness," "fame," and ‘"““fj’ „ „vidcd; fanning, 

obtained by going through any of the eight career p -rospecting in 

college, big business, seafaring, politics. Hollyw^d, ur number of 

Peru, and expeditions to the moon. Each or another (e g-, 

squares, most of which are rewards or penalties of one yp 
in the Hollywood career: "win Oscar, score 12 fame pom . . 

drops off shEhtly, cut salary by half). thTalUnto one of flnee 

player during the course of the game, most of which f career, and 

categories. One choice involves whether or not throughout 
the entrance fees as well as the rewards and penalties that .^.jHeration. 
the career path are generally among the factors taken m . the 
Another choice involves whether to move around the boar y 
dice, by using an “opportunity card” (which permits onc^ o 
“entrance square" of a given occupation), or by using an c*P . cn 
(which permits a player to move the specific number of squw P 
the card). A third type of decision occurs In the bargaining situ , 
results when one player moves to a square occupied by another p y \ j^h 
less the players reach a mutually satisfactory agreement, usually ^he 

the first player on the square pays a sum of money to the secon p ^ i> 
second player can “bump" the first to a penalty square (the par 
which represents unemployment and a resultant loss of time or 
As a model of the choice process, the game of “Careers suggests 
lowing questions which seem to be in need of exploration: 


1. In dealing with occupational choices, as well as with choice in 
what would constitute an adequate list of values? How is one to 
with the conceptual haziness of the terms representing the various va 
when they seem to mean different things to diflerent players? 

2. How fixed are these values in actual life? What system of proposi o 

might be used to predict both their initial development and their su se 
quent transformation? , 

3. To what extent are individuals aware of their own values, and those 
others? How does this degree of awareness affect their choice be av 
ior? 

4. How well can choices be predicted by a summation of the gaps between 

values and expectations as to opportuiudes for achieving them (i.e., ® 


60 





The Human Equation 


ARGUMENT 3: 

Human beinss cannot possibly study other human beings without bias. The 
student oj human behavior cannot but have a vital interest in the kinds of 
conclusions he reaches, and this prevents him from seeing and analyzing 
phenomena objectively. Furthermore, his biases also operate in the selection 
of problems for study. 


answer: 

It is true that behavioral scientists arc human beings and that their behavior 
is therefore guided by their own values, but this is true of all scientists. Among 
the values of the social scientist, however, is that of scientific objectivity, an 
he generally makes every effort to limit the effect of his own values on is 
presentation of the data and his assessment of the evidence. A sti more 
effective check on bias in the behavioral sciences is the fact that stu 
replicated and data are interpreted by different social scientists who ave c 

ent values and who provide checks and balances against bias, s ' 

in the selection of a problem (as distinct from bias m its . 

behavioral scientists and physical scientists are in this regard in uen y 
their own values. But in this case, too, the values of both may e 
to the selection of those problems which offer possibilities for * 8" 
advances in scientific knowledge. Even when problems arc selec e 
basis of other values, the collection and analysis of the data may 
highly objective. 


argument 4; 

Human beings, unlike inanimate objects, are guided by j. . 

tations. These are subjective factors and consequently canno e 
objectively. 


. Although human buhavior is guidud by subjeolive values and 

It does not follow that these entities catmot be studied objecU% y. 

devices — such as questionnaires and interviews, projective es , 

tional schedules, and experimental techniques— provide ^ 

enabling the social scientist to distinguish with some success 

ttieanings and action meanings. 

argument S: 

Predictions about human behavior may not hold because the 
involved may become aware of these predictions and a ter lei 
accordingly. 



lCh.3 

THE PROCESS OF INQUIRY , 

misconceptions is to demonstrate, folloftTng ar'tfsome of the argu- 

abihty to explain and predict effectively. The t,, d^ne in bt- 

ments used most frequently against the idea that m 
havioral science. 

ARGUMENT 1 : . i^gg 

Human beings are not governed by saentific choose among 

will. They are not physical ob}ects or automata but are f 
various courses of action. 


ANSWER. jjjjQ 

The automobile driver may subjectively feel that he is we 

other cars, but we can predict that he will not ordinarily a P . gjjj. 
can also formulate laws based on such tegular patterns o hehavioral 

eralize that "normative pressures will be effective m . jq ^eny 

conformity depending on the severity of the sanctions ^ gl,oice 

that individuals who are conforming subjectively feel that y 
in the matter. Even the freest of choices seem to have «“***’ . Jividoal is 
to say that they are '‘loverncd" by outside forces which ih 
powerless to control. Theories of choice emphasize internal o . 
values and expectations, leaving room for external forces as we . 


ARGUMENT 2'. 

Human behavior is too complex and changeable for a genuine 
to emerge. Every individual is unique; people behave diSerently n 
/lirtorfcflf periods; and anthropologists have demonstrated the wi 
of behavior among cultures. The most that we can hope to 
understand ind/vidMair in a given jocfery during a limited period O] 


answer: 

It has often occurred within the history of science that, prior to the 
tion of scientific explanations, phenomena have appeared to be 
complicated. For example, at one time it must have seemed preposterous 
one law might account for the velocity of tmy freely falling object, me “ ‘ ^ 
apples of all shapes, sizes, and colors; pears; people; chairs; and so on. 
fact that there are some differences in behavior among different societies an 
historical epochs does not deny the possibility that there are some (or 
similarities. As a matter of fact, the supposed differences may not refer to a ^ 
of human behavior but, rather, to constants or parameters similar to g * 
Galileo’s law for freely falling bodies: s = V4yf*, in which g (the gravitationa 
constant) takes on difierent values at diRerent latitudes, although the same 
law holds at all latitudes. 

62 



Research Strategy 


See. 3.4] 

(professional ethics have been developed in the different social sciences which 
scrs’e to reduce this possibility), much good may also come of it. 

ARGUMENT 8: 

It is not possible to perform controlled experiments on humans. 

answer: 

Many controlled CAperimenls have already been performed on humans 
(see Chapter 5). Such an argument usually is based on a lack of understand- 
ing of the nature of experiments. They need not be harmful to those who arc 
the subjects of the experiment. Also, it is generally not difficult to set up the 
kind of controlled condition needed for many kinds of controlled experiments 
with human subjects. 

ARGUMENT 9: 

The astronomer can predict the position of a given planet years from now . 
but how can anyone hope to predict where a given person will be in several 
years7 So many accidents occur which completely change the direction of 
human lives. Now if I hadn’t fust happened to fall down and land in t e 
hospital I would have been on that plane that was lost over the Atlantic. 

answer: 

Long-range prediction is not the distinguishing characteristic of a 
it can be achieved only in the few fields where the phenomena are isolated 
from outside influence and closure has been achieved. The phj-sicist^not 
accurately predict where a falling leaf will be blown by the wind, but 
we have evidence for laws such as the relationship between time of fr« a 
and distance fallen in a vacuum, we evaluate physics as an effective science. 
In the same way behavioral science has uncovered many relationships amoHj, 
phenomena which can be used to explain at least partially the beharior o a 
P'en individual and actually to predict his behavior — although with a su 
stantially lower probability of success than that achieved in manj (but not 
^1) branches of physics. The time may come when many now supposedly 
accidental” phenomena will be explained and predicted. 


3-4 RESEARCH STRATEGY 

^ olues and the Scientist 

"nie behavioral scientists differ on what they believe constitute the most 
effective approaches to research. Choices haw to be made because resources 
lor re^arch are limited, and the problem for the scientist is to make 
effective choices. Unfortunately, research on the research process itself has 


u 



THE PROCESS OF INQUIRY 


lCh.3 


The inleiterence of the observer with “ 

common to all the scieeecs. Cuttent tcchtt iqttes of tjala «»“ 
minimize this influence. More fundamcnl Invalidated if the predic- 

governing the velocity of a freely falling b^y is n jjjtering the velocity 

uons based on it do not hold true ns n resdt of ‘Svpossible, is 

b, shooting at the objeet. What is leqmted and what ,s cert 
that additional laws be developed about the wa^ m wh'cn 
their behavior when they have knowledge of the predic 
scientist. 

ARGUMENT 6; hmior 

Behavioral science has yielded very UttU knov. tedse obout obvious 

and seems to conr/jl moifify o/ trivial statements whtc ar grfreisc 

to most observant /ndfvM«e£j. It Is not a 5Ci^cf.’ tai ter, i 
in proving the obvious. 


What may be “obvious" to one individual may not be to ono ’ 
tory of many of the findings of social science • will reveal to ^ 
a great many of the findings were previously not “obvious to i ‘ j. 
triviality, social scientists generally grant the Insufliciencics m the p 
Uons, theories, caplanaiions, and predictions, but we might just as 
cule the field of meteorology because of the inaccuracy of weather 
A mote responsible role for the critic is that of working toward the ere 
conditions that will encourage the field to develop more rapidly. 


ARGUMENT 7; 

A science of human behavior will never develop because it is immoral an 
society will resist it. Human beings are not ciphers or equations an 
inhuman to treat them in this manner. Frogresr in social science hi m 
that human beings can be manipuiated, and this wilt set the stage for 
most complete fololiMrionism man has ever known. Furthermore, it wou 
be immoral to experiment on human beings. 


answer: 

Knowledge is power, and power may be used for good as well as for evil- 
Manipulation may be one result, but a fuller development of man s creative 
capacities might well be another. Social science offers the hope of freeing m^ 
from ignorance, In questions of morality, more than one value is usua y 
involved, and if some harm can come from experimentation on human beings 
* Berehon and Steiner, op cil. 

64 



Research Strategy 


Sec. 3.4] 

Mills’ argument is that social sdcntists often use scientific objectivity as an 
excuse for an uncritical acceptance of the framework of society and that, as 
a result, they spend their intellectual force on “the details of small-scale 
milieu” instead of concentrating on the large problems of society. 

A similar point of view is taken by Alvin Gouldner: 

There is one way in which those who desert the world and those who sell 
out to it have something in common: neither group can adopt an openly 
critical stance toward society. Those who sell out are accomplices; they rnay 
feel no critical impulses. Those who run out, while they do feel such im- 
pulses, are either lacking in any talent for aggression, or have often turned 
it inward into noisy but essentially safe university politics or into professional 
polemics. In adopting a conception of themselves as “value-free” scientists, 
their critical impulses may no longer find a target in society.” 

In a similar vein, Maurice Stein argues that some of the basic concepts of 
the sociologist (norm, role, and social system) imply a conservative political 
position; he charges that these concepts “can no more arrest the processes of 
rapid change than can conservative political ideologies.” 

One critical observer characterizes the Ideas of those following Mills' lead: 


What seems new, however, is the immediacy of the summons to each indi- 
vidual Iruofar as he is a unique existent to: (a) reject all formal organiza- 
tion and roles; and (b) insist upon total authenticity and undividedness for 
himself in "existence” and “being.” 

In this setting a new amalgam has emerged; we may characterize the new 
amalgam as an existentialist neo-Matxism aimed against "alienlion” and 
“self-estrangement” in all their forms, especially as these are incurred by 
involvements in the so<aIled organized society, mass culture, nonutopian 
social participation.'* 


It seems that the debate has pointed up a central question: Which strategies 
for the selection of problems can maximize the advancement of scientific 
wowledge as well as the solution of important human problems? Parsons 
^■tion stresses the importance of allowing latitude for the investigator to 
efine problems in terms of their “scientific” significance, but such a formula- 
tion docs not provide guidelines on, for example, the question of “ethical 
Neutrality.’’ Mills and Gouldner maintain that ethical neutrality in the selection 

W. Gouldner. “Ami-Minoiaur: The Myth of a Value-Free Sociotogyr in 
p Stem and Arthur Vidich <cds.). Socidogy on Trial (Englewood Cliffs. N.J.. 

rren.jce.H»U.l96J).p.4J. 

Maurice R. Stem. “The Poetic Metaphors of Soctalogy." in Stein and Vidich. op. cii.. 

'^iamin NeUon. "Sociology and Psychoanalysis on Trial: an Epilogue.” Psycho- 
^"ttnJthePtychoanalyiic Reiiew. XXXXIX: MS (1962). 


67 



[Ch.3 

•nlE PROCESS OF INQUIRY is 

been limited, and infonnalinn about what ol success 

primmily of a qualitative natutm of sttateyies 

achieved by the social sciences IS limited, obiecii e 


asc sparse . , scientific investigation 

Although behavioral scientists generally agree tha 
of human behavior is both possible and ‘*"'”'’’''f*^wtion of proteins 

seems to be the subject of much controversy concerns the ^ important 

for investigation- whether or not problems whic jn^portant problems 

in terms of their implications for human welfare arc also importa f 

for scientific study. . c~.:«irt«.1eal Association in 

In his presidential address to the American Sociological as 
1949, Talcott Parsons advanced the following point o 


It IS not a quKlion ot whether wc try to pJ'Jur lesoSce*. 

bilities. but of how. If we should put the overwhelming b^K or 
especially of trained talent, into immediately prac ica p • ^ ^5^ of 

some good — but 1 have no doubt that It would M'C systematic 

our greater usefulness lo society in the future. has priority ever 

work on problems where ihe probable scientific si^gn 1 _ 

any immediate possibility of application that the gr«a 
scientific advance can be made* 


Behavioral scientists have had much experience with *t‘°**,*^® between 
many individuals who control research funds. The confli . ... js any 
‘•basic” and “applied” research is not limited to the u 

chemist, who has been prevented from follwing a scienti parsons’ 

because of its limited remunerative potentiality, will rcadi y . over 
position states the importance of "scientific’* definitions o p 
definitions which focus on Immediate practical problems. ..uciion of 

C. Wright Mills takes the following stance in viewing the 
problems; 


. . . whether he is aware of it or not. anyone who spends fjjHy 

society and publishing the results is acting morally and usually po 
as well. The question is whether be faces this condition and maxes u^ 
own mind, or whether he conceals it from himself and from others an 
morally. Many — I should say most — social scientists in America 0 ^ _ 

easily or uneasily liberal. Th^ conform to the prevailing fear of any p 
ate commitment. This, and not ‘^icnlific objectivity,” is what is 
wanted by such men when they complain about "making value judgmen s. 


•Talcott Parsons. The Prospects of Socsologica' Theory." American Sociologic 
view, XV: 15 (1950). „ ,o<l) 

’•C. Wright Mills. The Sociologieal Imagiiialion (New York: Grove Press. 



Research Strategy 


Sec. J.4J . 

variables by n,eans of quantitaUve aBalytio 

tore is a vast gap between those who “ ^““Son of Z 

descriptions of a given culture and ,,„rtiiTed survey techniques, 

structured interviewing techniques and hi^ly s « 

n,e behavioral scientist who rs “™”^i^“j"Lon,ena are forced 
that the realism of the natural social worl -nalvsis which focus on a 

into a quantitative mold, and that techniq Mnlcxt of the situation 

limited set of “variables” play havoc by maintaining 

under snrdy. The quantitative ^ »P=™ent. which 

that it is the controlled situatron. such phenomena to the 

provides the best opportunity for natural situations. He 

vital factors and processes which arc oft available such as multi- 
might also refer to the improved ° , techniques, which make 

variate statistical procedures and electronic p of variables at 

it possible for the scientist to take ,jon an objective manner, 

one lime and thus deal with the context of a situation 

SMKsy end rtc Conre« of Ihcmselvcs within the 

The advancing of those tdeM which practicing scientists, 

context of justification has widcspre subjected to objective 

The scienUSc enterprise is ^^''“"."'^^''“IdVeiivT further attention. This 
test, and dtose for evid-c' “ ,, ,b„, p,„ and 

pragmatic strategy is built into the 

parcel of it. ^jih some of the points of view 

This pragmaUc strategy' comes current strategies m the social 

stated earlier. For example, m evidence for certain nega- 

scicnccs, implications were drawn as studies cannot succ^ 

live evaluations of some strategi complex situations; quantilauw 

in adequately dealing with the . purposes of verification; the 

studies cannot provide objectivx c\i quantitalist surs'cy-rcscarcher 

work of the expcrimenlal comprehensive theoriring and prcdic- 

lacks scope; strategies that enco also seems to be a lack of evi- 

tion arc too ambitious for the p^cm. strategies— e.g.. 

dence to support the ^ constitutes the most ccruin road to 

quantitative and rigorous social « important political problems 

the advaneemcnl of knowle ^ -jniddlc-rance” theory offers the great- 

ccnaln teconvtntccd log^ h 
Mott of the evidence for bbtoo- of Kicncc. One rccu^n 

qualitative and derives do welt to be waty of cvaluatioos 

?bcmc seems to be "“t ,.ldence, such as those just cted. 

of strategics that are based on ins 



lCh.3 


THE PROCESS OF INQUIRY • . d to take the safe 

„t problem, is a fie.ioa and .hat te aonelusioas 

road of avoiding a eer.am class r^t 4, i„5,it„lions. Stem goes 

lot which may imply tnndamental ^ , Has in the selection ol 

somewhat further and argues that this j t „ost sociologists, 

problems has also permeated the major “““P . J diflercnt recon- 

The debate seems to focus on the the selection 

structed logics for the context of politicahy and historically 

of problems, and the selection of ptobtos which "^P Kntat.ve 

“important " But like any rcconstmctcd lope He selection of 

hypotheses about the fruitfulness of certain pt conformity with some 

problems. The ultimate test of any logic-m-use is not of accu- 

previously successful reconstructed logic “ t ^ considerable scope, 
rate predictions and substantiated explariations research 

The sc.cn.is. is no. necessarily faced whh hrs li; “ ;„mple, had 

Situation is often quite salutary. The medical student s djgnition 

to contend neither with an “establishmenl . for an appl'«d 

of the problem nor a shortsighted funding institulton pressing 
and scientifically uninteresting definition of the problem. 


The Quantitalive-Qualitadve Axis 


itanve^xts ,,, nrccise techniques. 

Some social scientists prefer highly quantitative and P . .. jitua- 


and there ate those who prefer to stay as close as p^> oreferences for 

lions and conduct research along qualitative I'ttcs. unstruc- 

quantitative or qualitative studies, laboratory or field e p .vg^uies, and 
tuted, semistructured, or structured interviews and obserya . 

so on. These differences occur within each of the social psy. 

disciplines also tend to be located on the quantitative-qua i sociol* 

chology, for example, is generally closer to the quantitative p ^ychology. 

oev. but sociolosv is Eenerally closer to it than anthropo ogy. p-soeri- 


ment is quite popular. In srxdology, the most frequent mode ^ j,ave 

is the survey, and a variety of quantitative design and analysis e 
been developed, Anthropologists also rely heavily on j of time 

techniques tend to less structured and they often devote a grea 
and attention to securing the kind of details which help to provi 
context of the phenomena under investigation. Hifferences 

Some of the differences within a given field are as great as 
between fields. The clinical psycholo^t learns to have an apprec 
objective measurement, but his orientation tends to be far ^ alitatWe 
and contextual than that of the experimental psychologist, "nie qu 
sociologist may adopt a method of participanl observation with t e 
understanding a given system as n r*’"<-ti<Tn;niT unit; the survey re 


- - - ‘ functioning unif, the 

analyst, on the other hand, operates whh a set of dependent and in ep 



Data Collection 


For the scientist, a kno\^ ledge of the nature of theorj’ and the dynamics of 
[Search is insufficient for achieving the goals of science, as is the understand- 
of a wide variety of specialized techniques for the collection, measure- 
ment. and anai>-5is of data. Substantial results may be expected only when 
' are understood in the framework of the nature of theory and 

e djuamics of research and when both are put to work in the dcs'clopment 
testing of theory. 

Iq Chapter 4 general principles of data collection are discussed; the detailed 
®f the methods of data coUecuon are left for Chapters 5 through 9. 
definition of the problem provides a means for the investigator to set up 
so that the resulting data will be valuable for the advancement 
- research desien, whkh lays down a more speafic blueprint, 

constructed on the b^is of this definition. A central problem with 



lCh.3 

THE PROCESS OF INQUIRY f t ies such as 

Numerous factors militate against ohpctiw ° and within 

attempts to substitute polemics for f of the diSerent 

disciplines. But whatever evidence therms on ft R are 

rcseareh strategies and tactics should be utOiaed, 

not unlimited and choices must >>» ■»»'''" . ^ „„cem «*>- 

The scientific advances seem In have n„c hand, and 

ahihtj and sophisticated mcthodolopcal pnjeedates If the 

from a concein with theories of varying degrees ot P .^^ 

evaluation often made as to the sparsity of sys l^gory may 

setenees is correct, then further elforhi a. currently 

well prove worthwhile, aod all the melhodologica P ^ 

available will he required to collect and impersonal and 

such systematic lormnlations. The research P'“'” ' ,l,„,y will not 

mindless meeh.nisml it is a hum» t 

emerge unless it is eonsctously developed. Ufa all re ^ 

course, this too should be evaluated in terms of how fnjitlul P 
for the advancement of behavioral science. 

SELECTED REFERENCES FOR FURTHER STUDY 

RoLLo Handy and Paul Kurtz, A Current Apprals^ of ihe fjg title 

(Great Barrington, Mass.: Behavioral sciences with 

specifies, this volume views and evaluates trends in the description 

an eye toward some of the most recent developments. In linguistics, 

of some of the newer fields, such as information theory. cyhctneUcs. ung 
game theory, decision-making theory, and general systems t e 

C. Wright Mills, The Sociological Imagination (New York: much 

In this, one of his last books. Mills presents both a scathing indi 
that exists in contemporary sociology and the promise whic 
for mankind. Although the tone is polemical with respect to muc . -fjjnce 
the name of theory and method in sociology, he also emphasizes t P 
of theory and method for every sociologist. 

C. P Snow, The Two Cultures; And A Second Look (New 

1964). Snow’s "two cultures” separate the scientists from . jetri- 

tuals, and he maintains that the wide gulf which divides them is no o . 

mental but dangetous for civiliiaiion. He argues that the gulf must e 
so that we may cope with and harness the scientific revolution. 


.he edentis. has .o deal is .he «Mc.»hip he.ween .he ohse™. and 
S .h™..h , .ahe ap 

mletdews (and ques.ioiaia.res), eapec.ations, and soe.d 

,a.,„„.In.hedisenssionofeachaK.™acMhcv*«. 

relationships of the observer and „„bination of a thorough 

technical developments of these ^ mastery of a range of 

understanding of the da.a-collect.on ” f„ his task, 

possible techniques vthich best equips the investigator 


5 ^^ ^ Definition of the Problem 

lively establishing the theory, but rather as pointing up the possibility that this 
or a similar theory might be valid. j • c 

Some of the possible directions for future research were discussed m Sec- 
tion 1.5. These included a broadening of the formulation to include expected 
value fulfillment, an effort to improve data collection techniques, and an appli- 
cation of the theory to other areas of human choice. The first two o ^ 
approaches might lead to more accurate predictions, whereas the ** 
designed to extend the scope of the theory. This illustrates the genera e or 
on the part of social scientists to define their problems so as to improve t eory 
according to the criteria used for evaluating theory. In addition to assessing its 
scope and ability to predict, testing of the formulation will hopefully provi e 
more evidence for it. 

Selection of Concepts 

A wise choice of a scientific problem differs from an unwise choice in that 
it advances the goals of science; an unwise choice docs not. But how Mn one 
determine which choice will be fruitful before he expends a ® ® 

effort on a particular problem? Some general criteria can be used for fi - 
ance, but they must be used warily. One criterion is the degree to whl^ tne 
concepts involved have clarity, scope, and systematic import. But a 
clear and precise concept may be trivial; and the concept “human being as 
great scope, though it has thus far done little for the progress of social science. 

A more trustworthy criterion is that of systematic import. The pro ems 
involved in the process of assessing systematic import are great. How is 
researcher to determine the degree to which a given concept, such as norm, 
lias systematic import? Is he to try to locate all the social science studies 
utilize the concept? Even if he does so. how is he to evaluate the results of any 
given study? He may try to differentiate among the studies in terms of the 
'iegree of evidence for the propositions, their scope, and the degree to w ic 
the results are systematically interrelated, but each of these approac es poses 
further problems. 

Little research has been done on the solution to these problems so as o 
direct the researcher in the construction of a quantitative measurement or 
systematic import. Nevertheless a search of the literature relating to a 
problem has certain important functions in addition to that of provi mg 
basis for a qualitative assessment of systematic import. First, it is one o 
useful mechanisms for reformulating initial ideas as well as for forrning 
ones. This is an example of the aid provided by the community of scien- 
'Kts to the individual investigator within the context of discovery. ® 

Ws aid is generally unknown, for the scientist absorbs many ideas irom 
v«ious sources throughout his educational process; the determination o 
. ® complex interrelationships that helped him in the formulation o a give 
* ’*ould constitute a major research project in itself. 



PRINCIPLES OF DATA COLLECTION tCh.4 

In addition, the search for hematic import on the part of a given scientist 
subsequently feeds back infannation to Uie scientific community. The research 
report presents not only the findings bat the reievant literature as well. Thus, 
the search for systematic import provides the basis for communication among 
members of the scientific commuitity. If it is understood with reference to 
these functions, the problem of the precise cakulalion of the systematic import 
of a given concept becomes less important. The investigator must balance 
the resources required for an extensive hteraturc review against the resources 
necessary for a preliminary test of hypotheses that incorporate the concept. 

Certain materials can help the researcher to find those books, journals, and 
articles that are particularly pertinent for him. For example, Psychological 
Abstracts and Sociological Abstracts present brief summaries of articles from 
a wide variety of journals. Another example is a recently compiled inventory 
of research on human behavior. Human Behavior: An Inventory of Scientific 
Findings,' which summarizes the results of 1045 studies and provides com- 
plete references for ail the studies as well as for important review articles, 
A different kind of summary statement is provided by A Current Appraisal of 
she BehovtoraJ Sciences,^ which is organized by disciplines; anftropology, 
sociology, history, economics, political science, jurisprudence, psychology, 
education, information theory, cybernetics, linguistics, sign behavior, ^acne 
theory, decision-making theory, value inquiry, and general systems theory. 
All of the above sources are generally best used as an aid to locating the 
original materials rather than as a substitute for them. They constitute, of 
course, only a few example of reference materials. 

Many social scientists find patterns of informal communication lo be 
highly productive. Some develop a list of colleagues who share a similar focus 
and use communication within this group to keep up with current work as 
well as to test new ideas. Others depend more on professional meetings or on 
relationships with colleagues in their immediate locale 

Selection of Proposifions and Theories 

The definition of a problem involves the evaluation not just of the concepts 
utilized but of the propositions and theories as well. The criteria for these 
purposes (sec Chapter 2) are testability, sojpe, and the effectiveness of 
explanations and predictions. A search of the relevant literature with reference 
to propositions and theories will aid the process of scientific communication. 
In the case of concepts, clarity and sc(^ were viewed as secondary criteria; 
in this case the corresponding ideas of testability and scope are viewed as 
subsidiary to explanation and prediction. 

' Bernard Betelson and Gary A. SteineT, Human Btha\ior‘ An Inventory of Scientific 
Finding! (New York- Harcouit, Brace &Wotld, t964). 

*Rol)o Handy and Paul Kurlz,,^ Current Appraisal of iIk Behavioral Sciences (Great 
Barnnglcn. Mass ; Behavioral Research Council, 19M). 

76 



Sec. 4J] 


Research Design 


Tlic crittria of eflKlivc explanation and prediction constitute a vKt topic, 
the discussion of which will comprise most of the remainder 
For example. Parts Two and Three ot this volome focus on he 
measurement of data, and fundamental propositions dealing wnh human 
beharior are utilized to aid in obtaining an nndemtanding of ^ 

Four various statistical criteria are developed which may tx 
the evidence tor any given proposition. Part Hve has to do with «■' 
to which propositions are systematically interrelated, and 
as an additional criterion tor evaluation ot propositions and theory. 

Cumulation 

We often speak ot scientific knowledge as 
■cumulating"-I-as it there were a universal law ot nature that thi m v^mHy 
occurs wid, the passage of time. This manner of speaking 3 “* ^ 
ot science as a vast, impersonal mechanism that moves ahead regardless 
what any given scientists does or does not do. findines 

Actu^ly, the channels of formal 

and Ideas, and additional channels are ^ ^ others who can 

sequently difficult for the scientist to make himsel e J difficult for 
contribuw most to the development ot his ideas, and it is quaUy dfflctiU tor 
him to hear what is being saM, The existence 

munication aid many social scieniisls to overcom deuend on the 

Nevertheless, although the social scientist adequate 

automatic operation of scientific channels to give 1 developed 

hearing, or to confront him wilh the most important ^ *>'■”5 
by others, he must become familiar with those eommunicatron chanaels 
arc open, so that he can make good use of them. 


4.2 RESEARCH DESIGN 
Problem and Design 

The research design constitures the blueprint for the “’[IXfhh limited 
mem, and analysis ■'--^“stetmrprto. fo'Trde"enri. 

tioa of these? Are the methods of data collection niore effective 

be highly structured? Is an intensive study of n ^ ^ primarily 

than a less intensive study of n large sample? Should the analysis he pnm 

quantitative or qualUative? . . development 

The atlempt to formulate a research dcsi^ ve^ y ^ focused approach. 

of the mvesligation can aid the “ mh desto the implications of the 

In moving from problem deSniUtm to research design, tne imp 



PRINCIPLES OF DATA COLLECTION * 

general research goals must be outlined in order to malte decisions on specific 
procedures. The development of a research design, however, does not neces* 
sarily move the researcher toward more highly structured techniques. The 
question of structure is a separate one which may be evaluated in terms of 
its potential contribution to the soluUon of the problem. 

Unfortunately, the scientist sometimes discovers at the completion of a 
project that, although the results are quite interesting, they do not really answer 
the questions originally posed. Often the fault lies in a definition of the prob- 
lem so vague that it provides little guidance in the construction of the research 
design 

What the investigator sometimes docs in order to malte the problem defini- 
tion mote explicit is to make up hypothetical data which might result from 
a given problem definition and research design He then examines these data 
to determine whether or not they provide answers to the problem in which 
he is genuinely interested. If not. he proceeds to redefine the problem and 
alter the research design accordingly. 

It seems useful to define the problem before considering questions of 
research design. By allowing design considerations to influence problem defi- 
nition, the researcher tends to limit his choice to those problems about which 
objective data can be most readily obtained, lit some cases, problems of great 
Importance might be ignored because of measurement difficulties, Of course, 
there are many practical consideratkmt in each research project, but if these 
subsequently modify the definition of the problem, at least the scientist is 
fully aware of what it is that he is sacrificing. 

Sampling 

The scientist worfcs^ toward the development of propositions and theories 
that have the widest possible scope, but he can only obtain a limited number 
of observations. Thus, in order to secure the best possible test for his theory, 
he must choose these observations wisely. He must decide (1) what kinds of 
observations to make, and (2) how many observations to make. 

One of the most powerful ideas the investigator has to work with is that of 
probability sampling, a topic which will be treated more fully in Section 15.1. 
By selecting his observations in accordance with the principles of probability 
sampling, the investigator is greatly aided in obtaining a set of observations 
(sample) that is reprejentafiVe of the much larger set of observations to 
which his theory has applicability. The saving is immense. For example, 
a probability sample of only a few thousand Americans can provide an 
accurate basis for uncovering the opinions of the entire population. 

ProbaMi!}’ sampling refers to a particular method of selecting a sample or 
a subset of units from a population or larger set. Each unit or clement of the 
population has a known probabili^ of being included in the sample. Although 
individual human beings frequently constitute these units, they may be any 



Research Design 


enUty in which the inttstigitor is inlerestcd— e.g., tamilics, organmtio , 
communities, societies, time periods, pages of a newspaper, and “ 

One common type of probability sampling is simple random “"'P''"®' 
which each unit or subset of units within the 

of being included in the sample. It, for esample. numbered pieces of pa^r 
in a hat correspond to the population, then a simple 

be approsimated by mining up these pieces esttemely well and then having 

a blindfolded individual selecting the sample.’ ..v-, ,™t„t 

The method of sample selection constitutes evidence ” 
of justification. Probability samples help the mves^atot to validate li'’ 

sitions widely and thus to assess dteir o™ Semee S 

advantage of aiding the investigator to generalize-w, h a known dege^ 
errorJndings based on his sample data to the P»P;"“" 
sample was dLwn. As the sire of the probability sample increases, this degree 

of error shrinks rapidly. , . ccveral 

Decisions on the sire and type of sample must . „.(,urecs are 

factors. One of these is the praaical consideration that r 
limited. Although the investigator might decide t at e of tjieir 

to estimate the characteristics of the population to within I pement of meir 
actual occurrence, he may not be able to of hetemgeMity 

yield this degree of accuracy. Another factor is P 
of the entltii selected for investigation. The ''>'">'1'”“^ or 

very much about variations in the chemical pro^ K.in« from different 
compound from one sample to another but In”’" f, 

cultures or subcultutes differ veiy markedly^ mechanisms of 

Whether this difference constitutes a variation m the 

human behavior or merely in its outward mam esta i . „uided very 

In coming to a decision, the investigator would do weQ W be gji 
closely by his definition of the problem. It n»g » nrobability sample 

ing to^lieod research resources on 

of the entire population of individuals m the . _.oij.biiity samples, 

turc would be great because of the high cos o o of discovery, a 

But, if the definition of the problem '“"P^" detailed data on a 

superior strategy for a given problem mgh ^ „.en 

non-probability sample. The money the same token, a de- 

be employed in other phases of the inwtiga . y dian 3 less 

tailed case study of one medied school may If^tes. One strategy 

detailed study of all of the medical schools m statistics 

* More convenient for the researcher » *5 Corporation. A Million 

textbooks contain such tables. For Vnrk- The Free Press of Glenec«. 

Random Digits with 100,000 Sormai JJevwtei ( . Interstate Commerce Commis- 

I9S5). A table issued to researchers free of R the Iniers^^^^ ^ 261-A-l 

Sion's Table of 105,000 Random Decimal DigHs. 5> statistics 51494). 

(Washington. D C.t Bureau of Transport Economics and Stausucs 



PRINCIPLES OF DATA COLLECTION (Ch. 4 

adopled by some researchers is to make use of large probability samples only 
at the point at which propositions of conyderaWe merit have been developed 
and are ready for wider testing. The argument supporting this point of view 
is that It is not important to be able to say something that is true for the entire 
population of a given society, if that something is a trivial statement which 
adds little to our ability to explain and predict human behavior. 

4.3 THE OBSERVER AND THE OBSERVED 
The Observed 

The term observation, as used in this context, refers to the many different 
ways in which the investigator attempts to collect data about some reality. We 
may think of the investigator as the observer, the phenomena under study, 
as the observed; and the effect of observer on observed (and vice versa), as 
the interaction between the two. In attempting to understand the processes 
whereby data about human behavior ate collected by human beings, we must 
apply whatever knowledge we have about how human beings behave. This is 
an example of the relationship between theory and method, because the best 
theory available is needed in order to understand the data-collecting process 
and maximize its effectiveness. 

Procedures for learning about reality, or human behavior as it actually 
occurs, are never certain. The scientist comes as close to the reality as possible 
by utilizing procedures of data collection and checking his findings with those 
of other members of the scientific community. The reality is thus always that 
which is seen through the eyes of the observer, and in this sense the distinc- 
tion between the observed and the observer is difficult to draw. 

A good deal of social science theory tells us that the concepts of social 
relationship, norms or expectations, and values or goals have great utility in 
helping to explain human action. The medical student study illustrated the 
role of value expectations in explaining preference for medical fields. An 
earlier phase of the study compared students who fin all likelihood) had 
many social relationships among fellow students with students who had few 
such relationships. It was found that the former group exhibited a greater 
degree of agreement as to preference for medical field. 

Social relationships place the individual in a web of repeated interactions 
with other individuals, and this sets the stage for the mutual pressures or 
constraints among them. These constraints have specific content, referring to 
what should and should not be done. Those expectations which occur re- 
peatedly may be referred to as norms, and they may be viewed as messages 
that are sent and received and to which sanctions (rewards and punishments) 
are attached. These sanctions are effective insofar as they implicate the values 
of the individual to whom they are sent. Human action may or may not 
80 



The Observer and the Observed 


Sec. 43\ 

conform with normative expectations, depending on whether the sanctions 
support or interfere with the goals of the acting individual. 


The Observer 


We start with the obvious fact that the observer is a human being. But 
many of the social relationships, norms, values, and actions in which the 
scientist is involved are very different from those found in the lay situation. 
For instance, the scientist is a member of the scientific community in general 
as well as of various special scientific groups. The social relationships found 
in these groups carry with them norms of scientific procedure, and there are 
various mecharusms for enforcing conformity to these norms. Furthermore, 
the scientific community sets up its own goals (e.g., the genera! goals of ex- 
planation and prediction), which are instilled to a greater or lesser degree in 
the individual scientist through the educational process as well as in the con- 
text of his subsequent scientific activities. Finally, the scientist acts, and his 
own logic-ln-use or reconstructed logic may or may not conform to prevail- 
ing reconstructed logics. ... „ 

One of the values much emphasized by scientists is objectivity. Because 
objective procedures are highly explicit, the entire scientific community can 
participate in the validation of a gjven hypothesis or theory. Furihermor^ 
objective research procedures are of such a nature that they cm be repeated 
by other investigators. In this context, the judgment of a particular psj^o- 
analyst as to the mental health of a sample of respondents is much less objec- 
tive than a scale based on a series of items comprising a portion of mterview 
protocols from this sample of respondents. In all likelihood the psychoMalys 
could not communicate the very specific procedure he utilized “ ^u cien 
detail to yield repetition by others. Furthermore, it is extremely * ® y ® ’ 
because the procedure is usually not perfectly explicit even for the psycho- 
analyst, it may vary from one respondent to the next. 

Nevertheless, although objectivity is quite valuable in the context of ji^ - 
fication, an orientation wherein only highly objective procedures are utilizeo 
may hinder the scientist in the context of discovery. The way in w ic vanous 
psychoanalysts conceive of mental health and mental illness may igh^ 
suggestive to the social scientist, and to ignore it simply because of i su 
jective nature is to make objectivity an end in itself. It is important ‘ ^ 
propositions, but it is also important to develop hypotheses an e • 
Both are needed for the development of scientific explanauons and preaic- 


tions. • 1 , • • 

Another important value within the scientific community is s epUcism. 
A skeptical attitude makes sense if we bear in mind the severe imi a 
research resources that usually exists. There are many things a mt 
done by the scientist but only some of them cart be done. Skepticism e 



PRINCIPLES OF DATA COLLECTION tCb- ^ 

the scientist to make a thorough assessment of ideas and findings before invest- 
ing his limited resources in them. 

Interaction Between Observer and Observed 

The Heisenberg principle of indetemunacy illustrates the effect of the 
observer on the observed in quantum physics. The process of observation 
necessitates the bombardment of the observed particles with photons of 
light, thus changing their position and veloaiy. The position and velocity of the 
particles prior to such bombardment thus remans unknown. 

The situation is not quite the same in the observation of human behavior, 
for the social scientist can hide behind one-way mirrors for some situations, or 
he may enter certain situations as a participant and disguise the fact that he is 
an observer. Even in these instances, however, there is interaction between 
observer and observed, for the phenomena observed affect observers differently, 
and scientists may try to read into their observations some degree of agree- 
ment with their colleagues. 

Most research situations in the social sciences arc not of the one-way 
mirror type. The interviewee or experimenter or observer brings his own set of 
constraints which apply to the individuals under observation. Very little is 
known about the nature of these constraints, for some of them arc extremely 
subtle. A middle<lass interviewer of a working-class respondent may have a 
carefully developed technique tor asking questions and reacting to answers 
that does not in (he least betray his own values and expectations. Vet, the 
fact that he is a middle-class interviewer can hardly be hidden from most 
respondents, who u'ould be aiert to such cues as type of dress, manner oi 
speech, and the very fact that interviewing is not a working-class occupation. 
If, to avoid some of these constraints, interviewers of working-class individuals 
were selected from among working-class applicants, a somewhat different set 
of constraints would emerge. Some feelings might more readily be divulged 
to a social equal; others — c.g., any ideas which might be interpreted as pro- 
middle-class or anti-working-class — would be more carefully hidden under 
these circumstances. 

Because the observer can control his interaction with the observed only to 
a limited extent, it behooves him to take those measurements which will enable 
him to correct for the intcractioo. For example, the middle-class interviewer 
might ask the respondent what he thinks the interviewer’s expectations are. 
Efforts to develop objective observational techniques, and the replication of 
observations by other social scientists, achieve some degree of control with 
respect to the effect of the observed on the observer. 

As for the effect of the observer on the observed, what seems to be needed 
is the collection of data on this effect as well as the other phenomena in 
which the investigator is interested. The fact that the respondent’s behavior 
will be influenced by the observer is a research problem in and of itself. 



Degree o/ Structure 


Sec. 4.4] 

Explanations of such effects both add to our overall knowledge of human 
behavior and contribute to our ability to correct for this interaction in research 
situations. 


4.4 DEGREE OF STRUCTURE 
The Uneasy Compromise 

In most of the behavioral sciences there seems to exist an uneasy com- 
promise between those who conceive of research as a highly structured, objec- 
tive, quantitative, and rigorous affair and those who are more quditative y 
oriented and less concerned with rigorous proof. Claims of superiority ^e 
made by advocates of both methods, but trivial or fruitful results may be the 
outcome of either. 

The controversy seems to be one aspect of the division between the two 
cultures”*— the sciences and the humanities. Some historians, for examp e, 
treat their discipline as one of the humanities, whereas others place it among 
Ihe sciences. Some economists rely heavily on Aristotelian logic as we m 
introspection, whereas others are the model builders, cconometncians, MO 
reconstructionists. Some psychologists would exclude introspection on 
part of the Investigator while others regard introspective data as a imssi e. 

Within sociology the controversy is best viewed in historical pers^ • 
Hinkle and Hinkle « discuss the early period of American sociology (19UJ- 
1918) as one characterized by social reformism, faith in progressive 
change, a belief in the existence of natural laws governing human ® ® ’ 

and the general quest to make sociology scientific. An imports con 
centered around William I. Thomas’ and Florian Znameckis 
Peasant in Europe and America, which made extensive use of persona 
ments, such as letters and autobiographies, in order to ® * 

detailed “life-histories” or case studies. Many sociologists fet . 

provided an excellent opportunity to convey a thorough account o ^ P 
of life, and, in particular, man’s inner mental life. A num r o ‘I , 
or statistically oriented sociologists attacked the method ® ^ method 
the citation of examples does not constitute scientific proof an a . . . -gj 
‘s highly subjective, especially because the investigator can se ec i 
to suit his own purposes. PTtent with 

This controveny within sociology has been resolved ^nf'ribute 

'Host sociologists aereeing that both case studies and statistics 
to scientific process, fhe advantages of case studies he not sojn“ch m ^ 
context of justification as in the context of discovery. Quanu a i 

J Rescoe C. HinWIe. Jr. anJ Gisela J. Hinllc. The Detelopmeni of Modern Socto S7 
** York: Doubleday i Company. 1954). 



lCh.4 


PRINCIPLES OF DATA COLLKTnON 
considered especially important in the context of justification. Actually, the 
compromise is an uneasy one. ITte controversy between the quantitatively 
oriented, who believe in highly structured techniques, and the qualitatively 
oriented, who believe in less structured techniques, does not generally mani- 
fest itself by way of formal or informal debate; rather, it is evidenced by 
research methods used. Actually this is far more consequential than mere 
debate, for what sociologists do usuaHy matters more than what they say. 

Levels of Abstraction 

Both less-structured and more-slnictured methods seem to have played 
important roles m the history of the sciences generally. Within one general 
pattern of research, investigation may move back and forth between the two. 
In the medical student study, for example, the first phase was largely explora- 
tory in nature while the second phase was fairly structured. This movement 
between methods of data collection is analogous to the movement of investi- 
gators up and down the ladder of abstraction in order to develop systematic 
theory and validate it. 

Abstraction is, by definition, a selection of only some elements from among 
the large set of elements under observation When the survey researcher, for 
example, asks an individual to react in one of several possible ways to a given 
hypothetical situation, he is limning his focus to the very few characteristics 
which he includes in the hypothetical situation and the very few response cate- 
gories. This abstraction process is utilized by all investigators to some degree, 
because the very language m which we think forces us to select some phe- 
nomena over others and to patfera them in certain ways. Thus the contrast is 
drawn between more structured and less structured methods, and not between 
structured and unstructured methods. There docs remain a difference in de- 
gree of abstraction, however. In the more structured approach, the investi- 
gator comes to the data-collection situation with consciously fomulated ideas 
which may then be tested and further developed. In the less structured ap- 
proach, the researcher often uses the research situation as a means of focusing 
his initial ideas, which may be vague and not consciously formulated. 

To the degree that the compromise between the two methods is an uneasy 
one, investigators will experience difficulty in moving back and forth between 
them. To rely on structured methods exclusively is to dose down many ave- 
nues which might have been used for the best possible development of a 
theory. A dependence on less structured methods may mean failure to test 
the theory adequately. 

The advantage that highly structured medtods have within the context of 
iustificathn has to do >vilh their objective aature. The peocedares of obsen'a- 
tion involved are relatively explicit and repeatable by different observers. In 
addition, where observational procedures arc explicit it is not difficult for 
the investigator to measure the impact of such procedures on the individuals 
84 



PRINCIPLES OF DATA COLLECTION ICh. 4 

living organisms. He may start with simple organisms and later move on to 
more complex ones. Thus he creates a chain oC research situations and utilizes 
the knowledge gained in the situations with more closure to attack the more 
open, “real” environment. If he can develop an effective theory in a closed situ- 
ation, the researcher is in a better posidon to move on to more complex 
situations. 

Movement from situations with more closure to those with less is certainly 
not the only direction research can and does take. The mircrobiologist can 
develop ideas from his in vivo research that can be further explored in the 
in vifro situation. In the social sciences, it is often not possible to set up a 
highly controlled situation without having made initial field studies. The his- 
tory of science seems to suggest that investigators might do well to move back 
and forth among research situations of varying degrees of structure. Such pro- 
cedures, when based on fruitful theory, may well make it possible to attain 
closure, even in the more complex situations. 

SELECTED REFERENCES FOR FURTHER STUDY 

George Caspar Homans, Social Behavior: hs Elemrniory Forms (New York. 
Harcourt, Braes i World, 1961). The data that form the raw materials for 
Homans' theory are the empirical propositions he developed in The Human 
Group, as well as many other studies (hat have appeared since World War ZI- 
He attempts to explain these findings by extrapolating general propositions from 
economics and behavioral psychology. 

Matilda White Riley, Soeiohgleal Research- A Case Approach (New York; 
Harcourt, Brace & World, J963). A variety of well-known studies is used to 
illustrate different approaches to data collection. Detailed commentaries on each 
of the studies are presented, and (hey are also viewed in relation to one another. 

Ragnar Rommetveit, Social Norms and Roles (Minneapolis: University of Min- 
nesota Press, 1954). In the first part of this monograph, the author develops 
his own role theory and relates it lo the significant literature on this topic; in 
the second part, he presents the results of various partial tests of the theory. In 
sum, we have an illustration of the way in which data collection is guided by a 
previously developed theory. 


86 



^4 p_P 

Is 


A 


T‘’,r;r 
b 15 1 


The Experiment 


5.1 INTRODUCTION 

The experiment, a method for the collection and analysis of data, has con- 
tributed so vastly to developments in the physical and biological sciences that 
it is widely thought of as the foundation for the scientific method. Within the 
social sciences It has been utilized chiefly by psychologists, but there are indi- 
cations that it is coming into more widespread use within sociology and other 
social sciences. 

In common with other methods for collecting and analyzing data, the ex- 
periment is designed to aid in (he development and testing of propositions and 
theories. Most of these have to do directly or indirectly with cause-and-elTect 
relationships, and thus it is the task of the researcher to collect the kind of 
data which provides evidence for the existence of such relationships. There 
are various types of evidence that can be advanced for or against the existence 
of a given causal relationship. A number of these were categorized by John 
Stuart Mill as the method of agreement, the method of difference, the method 
of concomitant variation, and the method of residues. Although Mill called 
these “the experimental methods,” one or more of them are actually utilized 
in modem form in all scientific methods for the collection and analysis of 
data. After a discussion of Mill's methods, we shall be in a position to under- 
stand the nature of the experiment as distinguished from other methods. 

5.2 MILL’S EXPERIMENTAL METHODS 
Method of Agreement 

Mill describes the method of agreement as follows: 

If two or more instances of the phenomenon under investigation have only 
one circumstance in common, the circumstance in which alone all the in- 
stances agree is the cause of the given phenomenon.' 

' Quoted in Morris R. Cohen and Ernest Nagel, dn tntrmliictwn to Logic and the 
Scienlifie Method (New York: Harcourt. Brace i World. 1934), p. 251. From John 

. 87 



THE EXPERIMENT 5 

Let us suppose that the phenomenon under investigation is the plan ot par- 
ents to send their children to college. Let us also assume that the only circum- 
stance which a number of parents planning to send their children to coliege 
have in common is the fact that they saw a film that stressed the importance 
of a college education. Then, according to the melhod of agreement, we may 
infer that exposure to the film caused the parents to plan to send their children 
to college. 

There are a number of things wrong with this argument. First, it is difficult 
to imagine a situation in which individu^ have only one thing in common. 
The fact that they all are human bein^, speak English, live on the planet 
Earth, or live m the twentieth century would constitute other characteristics 
in common. Is it possible to find a situation in which individuals have only 
one thing in common? Second, a causal statement cannot be proved with any 
certainty if it is a statement about reality. We may obtain evidence for a 
given cause-and-cffect relationship, but we cannot say definitively that any 
given factor is the cause of any other factor. 

In spite of these inadequacies, however, the method of agreement can be a 
valuable tool in the analysis of causc-and-cffeci relationships, for it can be 
used as a mechanism for obtaining evidence about such a relationship. 

For example, what is commonly called the descriptive study follows the 
rationale of the melhod of agreement. Whatever the investigator wishes to 
describe corresponds to the "phenomenon under investigation"— e.g., medical 
students who are interested in public health as a career. He may then attempt 
to describe the characteristics that a number of medical students with this 
interest (“two ot more instances of the phenomenon") have In common. The 
approach need not be to accept as evidence situations in which they have only 
one factor in common, for this will never occur. Thus, for example, these 
students, aside from job security, might be found to be very interested in help- 
ing people. The investigator does not then infer that these goals alone cause 
the students to be interested in public health; rather, he may treat such data 
as partial evidence for the existence of a cause-and-eiTect relationship. Of 
course, he may discover that no such relationship actually exists. 

Method of Difference 

The method of difference is stated in this manner: 

If an instance in which the phenomenon under investigation occurs, and 
an instance in which it does not occur, have every chcumsiance in common 
save one, that one occurring in ihc former, the circumstances in which alone 
f e two instances differ is the eflect, or the cause, or an indispensable part 
of the cause, of the phenomenon * 

1890 ''“h f- (New York and London; Harper and Bros , 

* Cohen and Nagel, op. cii , p. 256. 

88 



Sec.SJ^ MilTs Experimental Methods 

Suppose that the plan of parents to send their children to college is the 
phenomenon under investigation (as in the above example). We may then 
compare a group of parents exposed to the film on the importance of coUege 
with one that is not (a circumstance the two groups do not have in common). 
Let us assume that the only difference between these groups is whether or not 
they were exposed to the film. Then, according to the method of difference, 
the decision to send children to college is either a cause or an effect of seeing 
the film. Because the decision is made subsequent to the showing of the film, 
the conclusion (following Mill) is that the decision is the effect of seeing the 
film. As in the method of agreement, there are shortcomings to the reasoning 
involved here: the “proof” of the existence of a cause-and-effect relationship 
lies in limited empirical data, and it is difficult or impossible to locate phe- 
nomena that have one and only one difference other than that under investi- 
gation. 

Method of Concomitant Variations 

Mill's method of concomitant variations is stated in this way: 

Whatever phenomenon varies in any manner, whenever another pheooni- 
enon varies in some particular manner, is either a cause of an effect of that 
phenomenon, or is connected with it through some fact of causation.^ 

One advantage of this method is that it moves the focus of an investigation 
from qualitative hypotheses about the existence of relationships to quantitative 
hypotheses about the degree of relationships. 

In an experimental context, we might be concerned about the relationship 
between the number of minutes of exposure to the film and the degree to 
'vhtch parents changed their attitudes on the importance of a college educa- 
tion. Thus, a number of different groups could be set up with each one repre- 
senting a different degree of exposure to filmed material. TTie investigation 
would seek to determine whether the greater the time of exposure of a given 
group to the film, the greater the change in parental attitudes. If this were true, 
then according to Mill’s formulation, exposure and attitude change would be 
causally related. The shortcomings in this reasoning lies, as in his other 
methods, in the lack of certainty of such a “proof.” Although it does provide 
some evidence, any such association may be pure coincidence. 

Method of Residues 

Mill’s method of residues is stated as follows: 

Subduct from any phenomenon such part as is known by previous induc- 
tions to be the effect of certain antecedents, and the residue of the phenom- 
enon is the effect of the remaining antecedents.* 

* Cohen and Nagel, op. pp. 261-262. 

‘Cohen and Nagel, op. cii.. p. 264. 


89 



THE EXPERIMENT 

Using the example of the film on coDege once more, let us assume that we 
have two groups of parents who differ only with respect to whether or not they 
were exposed to the film. Let us also assume that, during the course of the 
experiment, both groups happened to read news stories about the greater 
earning capacities of college graduates. Finally, let us assume that both groups 
became more positive to a certain extent about college for their children as a 
result of the news stories, although this change is mixed with the effects of the 
film for the exposed group. Then, according to the method of residues, we are 
in a position to calculate the effect ot the film on the group exposed to it. 

This is accomplished by conceiving of the total change in the exposed group 
(D as being made up of two parts: that which results from having read the 
news stories (N) and that resulting from exposure to the film (F"). We can 
measure the change in the unexposed group which resulted from the news 
stories, and since this group has been equated with the exposed group, we 
may assume that this change corresponds to JV for the exposed group. We can 
also measure the total change in the exposed group (D . In order to calculate 
the effect of the film, then, we perform the following subtraction; 

F=T-N 

As we shall see in Section 5.4, “interaction effects" may lead to a much more 
complicated situation than Mill formulated, although his procedure would be 
accurate in many situations. 

5.3 THE NATURE OF THE EXPERIMENT 

The nature of the experiment can be most clearly seen by examining the 
illustration used for Mill's method of difference. One group of parents was 
exposed to the film whereas the other was not, and the change in the college 
plans of the exposed group (assuming that the unexposed group experienced 
no change) was then attributed to their exposure. A defining characteristic of 
the experiment has to do with whether or not the investigator can assign the 
presumed causal or independent variable (exposure or lack of exposure) to 
the different groups. In this example such assignment does in fact take place, 
with one group being shown the film and the other group not seeing it. The 
investigator thus plays a very active role in the experiment, manipulating the 
experimental conditions in a certain way. 

While variables such as exposure to a film can be easily manipulated, this 
is not the case for such variables as age and sex. The investigator cannot 
assign age or sex to any group. Thus, when such variables are conceived of 
as causal or independent, other teduuqurs of iDvesiigatio!} than that of the 
experiment ate generally used. The exception to this rule occurs where the 
investigator constructs a concept which, in fact, can be assigned, that is cog- 
nate to the non-manipulable one. For example, an investigator interested in the 
90 



Sec. J J] 


The Nature of the Experiment 


impact of age on certain factors might be able to study this indirectly by 
manipulating an individual's “subjective age” — i.e., how old the individual 
feels as distinct from how old he is chronologically. 

If the defining characteristic of the experiment is the assignment by the 
investigator of the experimental conditions, the major resulting characteristic 
is a high degree of control over various types of error. This is generally 
achieved by the assigning of conditions so as to secure an approximation to 
the kinds of situations discussed by Mill. In the method of difference, for ex- 
ample, the instances investigated have every circumstance in common save 
one. Through his assignment of conditions, the experimenter can approximate 
the situation where two groups differ in one respect only — e.g., seeing or not 
seeing a film. In this way, all other factors are controlled, and he is then able 
to attribute any resulting difference between the two groups to the impact of 
the film. The specific techniques used by the experimenter to achieve such 
control come under the general heading of randomization and matching 

Randomization 

One of the most important controls involved in experimental procedures 
is the technique of randomization. Randomization has to do with the process 
by which subjects (or whatever other units are being investigated) are assigned 
to the various experimental procedures. Let us assume we are testing the 
effectiveness of a given communication in changing the values of the subjects. 
The classical experimental design provides for an experimental group, which 
is subjected to one treatment (in this case, exposure to the communication), 
and a control group, which is not. 

Randomization, in this context, is a procedure for assigning available sub- 
jects to the experimental and control groups in such a way that the two groups 
will be quite similar. It is important to achieve this similarity, for then differ- 
ential changes in the values of subjects in the two groups may be rnore readily 
associated with the different treatments involved, rather than with the pre- 
existing differences between the two groups. A procedure that might be use 
in this context is the assignment of a random sample of subjects participating 
in the experiment to the experimental group and a random sample of the same 
size to the control group. In this manner each of the two groups becomes a 
random sample of the initial pool of available subjects, for each subject m 
the pool had an equal chance of being placed in either group. Thus each of 
the two groups can be treated as being representative (within an estimated 
degree of error) of the initial pool of subjects, and consequently the two 
groups will tend to be quite similar because “things equal to the same thing 
are equal to each other.” 

The significance of the randomization procedure is far-reaching, for it 
achieves a matching of the two groups on all variables, with an estimated 
degree of error. This degree of error can be reduced by increasing the size 

9T 



THE EXPERIMENT ^ 

ol the experimental and control groups. (Of course, no matter how large the 
goups are, there is always a finite probability that the groups will differ by a 
given degree.) 

Frequency Distribution and Precision Control 

Randomization procedures may be contrasted with matching procedures, 
which attempt to equate the groups only on a limited number of variables. 
One type of matching procedure commonly* used is' frequency-dislribulion 
control, which involves equating the experimental and the control groups on 
whatever variables are deemed to be important— e.g., those factors that might 
influence the effectiveness of the pven communication in changing the sub- 
jects’ values. If frequency-distribution control is achieved with respect to sex, 
for example, then both groups will contain the same proportion of each sex. 
Only a limited number of factors can be subjected to such control, however, 
and each of them must be identifiable and measurable. Randomization, on 
the other hand, achieves control (within an estimated degree of error) over 
factors that have not yet been discovered. 

Precision control is another matching technique that involves pairing incfi- 
viduals on the basis of several factors and then placing one member of each 
pair in the experimental group and the other in the control group. The ad- 
vantage of this technique over frequency-distribution control is the matching 
of the joint distribution of several factors, rather than of the distribution of 
single factors. Thus, frequency-distribution control based on sex and social 
class alone might lead to a much higher proportion of Tniddie<Iass males in 
the experimental group than in the control group. With precision control, 
however, the two groups would contain the same proportion of middle-class 
males. Precision control has a disadvantage, however; Because it is difficult 
to achieve this kind of pairing, a large proportion of potential subjects from 
the initial pool will remain unpaired and consequently will not be utilized in 
the experiment. This loss of cases will be especially great if the attempt is 
made to achieve precision coniroS on more than a very few factors. 

Important differences among precision control, frequency distribution con- 
trol, and randomization procedures arc illustrated in Table 5-1. Let us as- 
sume that we arc investigating the effect of a lecture on attitudes about the 
desirability of having female candidates for the presidency, and that we pro- 
ceed by lecturing to an experimental group and not lecturing to a control 
group. Given this problem, it would seem to be desirable to control both on 
sex and on social class, since we know from prior research that each of these 
variables is quite relevant to such attitudes. 

When we control on variables within this context we attempt to make sure 
that the experimental group and the contrrri group are as close to identical 
as possible with respect to the distribution of such variables. In this way, 
whatever influence such variables have on the experiment will make its im- 
92 



See. J_JJ The Nature of the Experiment 

pact felt both within the control group as well as the experimental group, and 
the experimenter will thus be able to account for it. 

In comparing the result of precision control and frequency-distribution con- 
trol techniques on the data, we can see from Table 5-1 that both techniques 


1 


TABLE 5-1 




1 A COMPARISON 

OP PRECISION CONTROL, 

FREQUENCY 

1 DISTRIBUTION CONTROL, AND 

RANDOMIZATION 



PROCEDURES FOR HYPOTHETICAL DATA 




ON 120 SUBJECTS 



; Confrol 

‘xper/menfol Group 

ControlGroup 

Procedure 


(N = 

60) 


fN = 

60) 



Moles 

Females 


Molet 

Females ' 

PfMijion 

Middle 



Middle 



eontrol 


10 

10 

class 

10 

10 


Working 



Working 




ctojl 

30 

30 

clots 

20 

20 



Motes 

Femoles 


Moles 

Femoles 

Frequency 

Middle 



Middle 


1 

diMributlon 



IS 

clots 

15 

5 

eoflfrel 

WOfklnfl 



Working 


] 


clots 

35 

15 

doss 

15 

35 

p., i~~ 

r.r _ ^ 



- 



} 


Motes 

females 


Moles 

Femoles 

1 Randomizelion 

Middle 



Middle 



\ 

rl.... 

12 

8 

clots 


12 


Working 



Working 



1 

clots 

18 

22 

dots 

22 

18 






. — 


... 


have two end results in common: (!) they both yield an experimental group 
with 30 males, 30 females, 20 middle-class individuals, and 40 working-class 
Individuals; and (2) they both yield a control group with 30 males, 30 fe- 
males, 20 middle-class individuals, and 40 working-class individuals. Thus, 
seemin^y both techniques arc cquaDy effertive in constructing an experi- 
mental group that has the same distribution with respect to these two vari- 
ables as the control group. 

But we can see that there arc also differences resulting from these two 
techniques, and these may be summarized as follows: 

1. Precision control produces 10 middle-class males in both the expieri- 
mental and the control group, whereas frequency-distribution control 

93 



THE EXPERIMENT ^ 

produces 5 middle-class males in the experimental group and 1 5 middle- 
class males in the control group. 

2. Precision control produces JO middle-class females in the two groups, 
while frequency-distribution control yields IS in the same experimental 
group and 5 in the control group. 

3. Precision control yields 20 working-class males and 20 working-class 
females in each of the two groups. Frequency-distribution control pro- 
duces 25 working-class males and IS working-class females in the ex- 
perimental group, and 15 working-class females and 25 working-class 
males in the control group 

The difference between experimental and control groups produced by fre- 
quency-distribution control techniques (as distinct from precision control) 
may or may not “contaminate” the factor of hearing or not hearing the lec- 
ture on female candidates for the presidency. Tire point is that the experi- 
menter cannot have confidence that any difference between the experimental 
and the control group will have no effect on his interpretation of the experi- 
mental data Any difference serves to change the pure situation in which 
the only factor available for him to attribute any difference between these two 
groups at the close of the experiment is whether or not a group heard the 
lecture. 

From Table 5-1 alone, it would seem that randomiralion is superior to 
frequency distribution control but inferior to precision control. This is because 
there is some variation, as between the experimental and control group, on 
the proportion of middle-class males, middle-class females, working-class 
males, and working-class females. Such variation Is less than that occurring 
as a result of frequency-distribution control but greater than for precision 
control. Actually, there are several other characteristics not shown in Table 
5-1 that make randomization generally superior to precision control; 

1. Precision control can only be exercised on a very few variables that are 
related to whatever is randomized. Thus, for example, if subjects are as- 
signed at random to the experimental group and the control group, then 
all variables having to do with individuaf differences are controlled. The 
degree of control is not perfect, as can be seen from the example, but it 
increases as the size of the groups increase. 

2. Randomization procedures utilize all of the subjects from the initial pool 
available for the experiment, but many cases are lost with precision 
control — the number lost increasing as each additional variable is con- 
trolled This is especially serious in the situation where the initial poo! 
of subjects is representative of some larger population. As cases arc lost 
from this pool, it becomes less and less representative of the population, 
and the experimenter is no longer able to generalize the results of his 
experiment to the populatton with any degree of confidence. 



Sec. 5.4] Experimental Design 

Frequency-distribution control or precision control techniques can be com- 
bined with randomization procedures. Such a combination might be advan- 
tageous when the experimental and control groups are fairly small and one 
or more factors were known to exert an important influence on the phenomena 
under investigation. The advantage here is that randomization would serve to 
control all variables relating to individual differences (if subjects were ran- 
domized) that were not controlled by frequency distribution or precision con- 


trol procedures. 

Randomization and the procedures of frequency and precision control ^e 
ways of preventing the interference of extraneous factors. We need not think 
of them only in the context of the assignment of subjects to experimental Md 
control groups, for we can also utilize these procedures in the actual setting 
up of the experimental- and control-group situations in such a way that they 
are as nearly equivalent as possible. It should be borne in mind that thus las 
all of our examples have had to do with randomization procedures as applied 
to subjects. They have wider implications, however. 

The experimenter, for example, will generally attempt to tna*ch the two 
groups on the various aspects of the physical situation under which the experi- 
ment takes place. If this matching is achieved by utilizing the same rc»ra for 
both groups, thus leading to a difference in the time of day at which the data 
are collected, he might decide to control the time factor by a randomiMtion 
procedure: he might use two experimental and two control groups, with one 
experimental group randomly selected for an early time on one day and one 
control group randomly selected for an early time on another day. 

Thus, the experimenter tries to equate the experimental- and control-group 
situations by matching and randomization so as to avoid error m rawing c 
elusions about the relationship between the Independent and 
variable. Without such matching, the expenmenter might conclude that ms 
different treatments of the experimental and control groups led to an observe 
difference in their behavior, when actually the observed difference arose rom 


a difference in the situations (e.g., time of day). 

There arc other procedures for achieving control. Where differences - 
tween experimental and control groups cannot be eliminated, it is ^ten fu- 
sible to take them into account through analytical proce ures. ne 
technique is that of “controlling on a third variable, a proce ure o e 


cussed in Chapter 16. 


5.4 EXPERIMENTAL DESIGN 


The Classical Design 

In order to understand the process of experimentation, [t « most helpful^ 
examine various sources of error as well as the way in w ic / 
work toward the elimination of these errors. } The genera unc on o 



THE EXPERJMENT ^ 

control group in the classical design is to create a situation as nearly equi^ 
lent to the exper'imcntal-groj^ situation as possible. ln this way, any resulting 
differences between the two groups may be attributed to the different treat- 
ments accorded to the two groups by the experimenter. A good deal of^error 
often stems from the fact that it is not possible to set up grQups_that_|;ha w 
every circumstance in commoiT^xcept t he one under inves tigation in the 
method of diflerencVy^ YKe experimenter has to ~«ttre tbr~Iess7~aird one pto- 
ceSure is to utiliz e randomization and matching techn iques to come as^dg se 
to the ideal as possible. _ 

Assuming that the two groups are essentially equivalent, we may proceed 
to trace the impact of a variety of factors on the two It would_bg_use; 

ful to view this experiment, which may be cailicS tfie before-a it^ experime^ 
with one control group, reference to a scries of sVagesrThese may be 
^sented schematically as follows; 


Bxperimenfal Control 
Group 


1. S«l«<tien of iubi«<i] mot<hl'>g ond/or rondomitotton Ye> Yet 

2. Expertmenlol eendincni: Tnal<hin9 oni/w tondomlzolien Yes Yel 

3 "Before" meosurenem (preiettl Yet Yes 

4. Expoti/re fa f««f trimvti Yet No 

i. "After" meoiure/nenr (poirfeiO Yet Yet 


We have already discussed the way in which matching and randomization 
techniques operate Co increase the similarity of the two groups. This is vital, 
for if the two groups differ on any factor related to the variables under In- 
vestigation, erroneous conclusions may easily be drawn. Experimental condi- 
tions refer to all the stimuli having to do with the experiment once the subjects 
have been selected and prior to the "before” measurement (the physical con- 
ditions involved in the experiment) and any interaction between experimenter 
and subjects. 

In this case, the "before” measurement constitutes an index of the initial 
attitudes of the parents on the subject of sending their children to college. The 
test stimuli would then be the film which is subsequently shown to the experi- 
mental group and withheld from the control group. The ‘’after" measurement 
is an index of the subsequent attitudes of parents on the importance of a 
college education for their children. The difference between the "before” and 
“after” measurements corstifuics the change in attitude. When the change for 
the control group is subtracted from that for the experimental group, we have 
a measure of the effect of the film itself. The rationale here stems directly 
from the method of residues. 

The method of residues constilutes a procedure for taking info account 
94 



Sec. 5.4] Expenmenta} Design 

those factors other than the test stimuli that affect the dependent variable. 
These may be divided somewhat arbitrarily into three groups: internal fac- 
tors, or those originating in the subjects themselves; experimenter factors, or 
those ori^nating in the stimuli sent out by tlic experimenter; and external 
factors, or those originating outside the experimental situation. 

An example of an internal factor might be a maturational process within 
the individual. This factor might be especially relevant if the experiment were 
conducted over a long period of time. If there were no control group, a change 
in the experimental group might appear to anse from the test factor, whereas 
it might actually be a result of maturational processes. Here we see the value 
of the control group. This error is avoided by utilizing the method of residues 
and subtracting the change in the control group from that in the experimental 
group. 

The “before” measure may alert the subject to the desire of the 
menter to influence his attitude with respect to a college education for his 
children — an experimenter factor. The subject may react to this positive y 
or negatively. If, for example, he tries to give responses in the posltcst whch 
he thinks are very socially acceptable to the experimenter, he will then c^ibit 
what appears to be a change in attitude as a result of seeing the film, u 
a pretest and positest measurement of a control group are taken, this error 
in interpretation can be eliminated. 

A potential source of external factors is magnified when the ex^runent 
takes place over a long period of time. Occunences reported m the mass 
media of communicatloo having to do with the importance of a col ege e uca 
tion may change the attitudes of subjects between the pretest an l e p« 
test. Thb change might be erroneously ascribed to the efficiency of the test 
factor unless a control group were subjected to the same external facets. 
Any infiuence from external sources may be canceled by the met o 
dues-i.e., by subtracting the change in the control group from that in tne 
experimental group. 


Solomon’s Two-Control-Group Design 

Solomon has dovdopod extensions ol the classical design that involve more 
than one control gtoup." One of these calls for one expetmiental group and 
two control groups. . t. — *i,» 

The advantage of this design becomes evident in situations 
■■before" measurement is expected to intetaet »ith the test sutuuh. For 
pie, let us assume that the ■■before" measuretneut on plans 
children to college only partially sensitizes the subject to , cubiect 

experiment. andThat the same is trite olexposnm to the H'"'- 
experiences both the ■■before” measnritment and the test st.muh, he may be 
• R. I. Solomon. -Extension of Conool Gmop Desi,n.- PeveXoto/m' M'™. X^VI. 
137-150 (1949). 



THE EXPERIMENT 

corfro! group in ihe cJassicat design is to c reate a s i tuation as nearly equlva; 
lent to Ihe cxpcrimenlal-gioup^siiuation as p oyib lc. In this way, any resulting 
differences between the two groups may be attributed to the difTerent treat- 
ments accorded to the two groups by Ihe experimenter. A good de al of enor 
often stems from the (act that i^is not jwssible_t_o_sct^p^r_pups that “have_. 
every circumstance in conirnbi?’ except the one investigation (asjnj^ 
method of difference j ~ Yhe experimemer has to settle'foFless, and one pro- 
cedure is to utihzc r^domiiaiion and matching t echniques to come as close 
totheidealaspossib!e.,_ ^ 

Assuming that the two groups arc essentially equivale nt, we may pro ^*" 


to trace the impact of a variety of factors on the two groups It wout d _ bcjs ej 
ful to view this experiment, t'bich may be called tlie beforg-n/ter expertwe^ 
wuh one conirol group, with reference to a series of sUgesTThesemayoc 
presented schematically as follows: 


I fxpen'fnenfo/ Contrei ^ 

I Iferges Gieup _ J 

f* 1, S»le«tlen of subie<tti mouKi'tQ «nd/or >eAdon>tteit«n Yet Yet 1 

€*pe»lm*ntal eordiiionv («witttilf>9 eod/et ioBd«n<iioi>o" Yet 'I** i 

3. "Sef^re" meotviremeni Ipreteti) Yet '"•* j 

4. Expawre fo »«•! iiimyli Yet N« ■ 

- S, “Alrtr" inectgremenMpojnetl) Yei Yet | 

l.._ . . .. - -i 

We have already discussed the way in which matching and randomiiaiion 
techniques operate to increase the similarity of the two groups. This is vital, 
for if the two groups differ on any factor related to the variables under in- 
vestigation, erroneous conclusions may easily be drawn, Ejperimenial condi- 
ftonj refer to all the stimuli having to do with the experiment once the subjects 
have been selected and prior to the -before” measurement (the physical con- 
ditions involved in Ihe expeYimenI) and any interaction between experimenter 
and subjects. 

In this case, the “before" measurement constitutes an index of the initial 
attitudes of the parents on the subject of sending their children to college. The 
test stimuli would then be the film which is subsequently shown to the experi- 
mental group and withheld from Ihc control group. The “after*’ measurement 
is an index of the subsequent attitudes of parents on the importance of a 
college education for their chffdrcn. The difference between the “before" and 
“after" measurements constitutes the change in attitude. When the change for 
the control group is subtracted from that for the experimental group, we have 
a measure of the effect of the film itself. The rationale here stems directly 
from the method of residues. 

The method of residues constitutes a procedure for taking into account 



Sec. 5.4] Experimental Design 

those factors other than the test stimuli that affect the dependent variable. 
These may be divided somewhat arbitrarily into three groups: inierrtal fac- 
tors, or those originating in the subjects themselves; experimenter factors, or 
those originating in the stimuli sent out by the experimenter; and external 
factors, or those originating outside the experimental situation. 

An example of an internal factor might be a maturational process within 
the individual. This factor might be especially relevant if the experiment were 
conducted over a long period of time. If there were no control group, a change 
in the experimental group might appear to arise from the test factor, whereas 
it might actually be a result of maturational processes. Here we see the value 
of the control group. This error is avoided by utilizing the method of residues 
and subtracting the change in the control group from that in the experimental 
group. 

The “before” measure may alert the subject to the desire of the experi- 
menter to influence his attitude with respect to a college education for his 
children — an experimenter factor. The subject may react to this positive y 
or negatively. If. for example, he tries to give responses in the posttest which 
be thinks are very socially acceptable to the experimenter, be will then e^ibit 
what appears to be a change in attitude as a result of seeing the film, u i 
a pretest and postiest measurement of a control group are taken, this error 
in interpretation can be eliminated. 

A potential source of external factors is magnified when the ex^riment 
takes place over a long period of time. Occurrences reported m the mass 
media of communication having to do with the importance of a col ege e uca 
lion may change the attitudes of subjects between the pretest an e pw 
test. This change raieht be erroneously ascribed to the efficiency of 
factor unless a control group were subjected to the same external factors. 
Any influence from external sources may be canceled by the met o o 
dues — i.e., by subtracting the change in the control group from tarn 
experimental group. 


Solomon’s Two-Control-Group Design 

Solomon has developed extensions of the classical design that involve more 
than ont control group.' One of these calls lor one eapentnental group ana 
two control groups. . . . .i,« 

The advantage of this design becomes evident m situations where fte 
"before” measurement is ejpected to interact with the test stimuli, hot earn 
pie, let us assume that the "before” measoremetit on plans “ ™ 

children to college only partially sensibzes the subject to o^ip . j 
eaperiment, and that the same is true ol exposote to the fi'”-,™'” 
experiences both the "before" measurement and the test st.muh, he may be 
• R. L. Solomon, -Extemion ol Conirol Croup Desiao.” F„MoeIc,I XLVI. 

157-150 (1949). 



[Ch.S 

the experiment purposes of the 

able to -p»t two and two tOEether.” If ^pasurement, we 

experiment stEmhcantly allccts x'„„alion the '““'"“'J 

have what ,s known as an st.mnh is out of p.opott.on 

effect of the test stimuli alone. 


Stoges 

1 . s’elee'lo'' of »ubieew= noteKmg o«d/o» 

rondomiio’ion 

3 Experimeniol tondiiiorj. noichiog 
Bfid/or rondomltQlion 

3. "6«for«" 

4. E»P«W'« ’*** 

3 . "Af*»r" fp®’*****' 

Ciffsronce 


txpenm«"f®* Control 
Group Croup I 


Control 
Group 2 


In this design the expcthnentM ^ gtoup!Vhe second control 

as they would be in any study va* cm t.al 

group is given no pretest. However we y the “before 

attitudes about sending ones children .^.u. 6,5^ control group. This 

measurements of the expcrimenial group and the first * ^ 

procedure is justified because the three groups , H^ued 

Ltion. and thus they may be scores 

degree. Consequently a difference D belwee obtained. Because 

and posttest scores for the second of a pretest or of interaction 

there is no possibility that D reflects the effe P may attribute 

between a pretest and test stimuli (no pretest was involved), we may 
D" to the influence of the test stimuli alone. . ..nms for the first con- 

The dilference D' between the pretest and ““’"V to the lack 

ttol group may be attributed to the influence of the I , of 

of exposure to test stimuli precludes the influence or “to the 

stimnli. The difference D between prefect and P“>® 
expeiimental fltonp, which experiences both a , j,touli. 

Stimuli, constitutes a measure of the influence of the pretest, t 
and the interaction between the two. 

If the experimenter’s mieiest is not limited to investigating I g.-jt 

to test stlLh, he will want to examine to process whereby P«‘“‘ 
toasure has an impact on the snbjects-b, itself o, in mteract.on with the 



Sec 5 •/] Experimental Design 

test stimuli. In this context the mechanisms involved in the data-colleclion 
procedure are viewed as data about human behavior, not simply as tools for 
testing other ideas. Already D’ constitutes a measure of the influence of the 
pretest alone, and it remains to estimate the magnitude of the interaction. 
Solomon provides a formula for estimating interaction (/) : 

I = D - {D' + D"). 

This follows the method of residues, for we subtract the effects of the pretest 
and the test simuli from D (the measure of the effects of these two factore plus 
that of interaction) to obtain a measure of interaction alone. Armed wit 
estimates of the influence of the pretest and that of interaction, we may then 
proceed to investigate the source of these two effects. 

The Solomon technique might be illustrated by hypothetical data on this 
example: 


fjtpefimeofoJ Confrel Control 

Group Group I Group 2 


, 6efer« m«q}tjre'n«ni 
I {% with collegr ploul 

1 After rneajuretnent 
I (% with college plont) 


0 = 90 — 20 D' = 30 — 20 O' = 30 - 20 
-70 =t0 *= >0 


Here the percentage of parents with collepe plans for their *1“"" 
greatly within the experimental group — i.e., from 20 percent to 9 pc • 
This dillerence D results (tom the impact of the before meMurement, th 
exposure to the film, and the interaction beween the two. For Control Group 
1, the 10 percent change D' results only from the before measurement For 
Control Group 2 the 10 percent change D' results only from exposure to the 
film (since no before measurement is taken). Interaction etween e 
measurement and exposure to the film is calculated as follows: 

j — Q — = 70 — (10+ 10) = 50. 

Here it can be seen that there is a large interaction effect in addition to the 
separate effects of the before measurement and exposures to the him. 

Before-After Experiment with No Control Croup 

We have already discussed before-after designs that involve one or more 
control groups. It is also possible lo conduct an experiment wi no 
group at all. Such an experiment would be schematized as fo ows. 



THE EXPERIMENT 


lCh.5 




[ Stages ^ 

'“L'Selectiirn of su6ieOiTVoisRrn9'oiMl/or rondomfzattan 
2. Experimental oondifions ntotcKIngcnd/or RHtdomtzOlion 
3 "Before” meosurement (preteil) 

4. Exposure to lest stimuli 

5. "After" meosuremeni (posttestt 


Experimental 

Group 


This design illustrates the fact that control can be achieved in various ways. 
The group, as well as the expenmental conditions, are “matched to them- 
selves” — I e.. they serve as their own controls. Tins is accomplished by com- 
paring a given group’s performance, at the beginning of the experiment, with 
its own performance at the close of the experiment. The difference between 
the pretest and the posttest is used as an estimate of the influence of the test 
stimuli Such a design may result in accurate conclusions if there is reason to 
believe that the external factors, experimenter factors, and internal factors 
exert httle or no influence on the performance of subjects and that there is no 
significant interaction between the pretest and the test simuli. Whenever such 
influences are present, and no control groups are employed to take them into 
account, their effects may erroneously be attributed to the test stimuli. 


Aim-Only Experiment wttk One Control Group 
The after-only experiment with one control group is described as follows: 


r ' 

Expert- 

— 1 

! 

1 

menfci/ 

Control 

* Stages 

Group 

Group 1 

1 1 . Selection of lubiects- notching ond/or randomitolKin 


Yes 1 

1 2. Experimental conditions matching ond/or (ondgmitotisn 

Yes 

Yes ; 

i 3. Before" meosorement (pcetesi) 


No 

j 4. Eiposvre fo lest jlwnvii 



1 5. "After" measurement (pcsttesO 

Y« 

y« > 

L-. . _ 


; 


This design achieves control over /actors not controlled by the before- 
after experiment with no control groups. The existence of a control group 
helps the experimenter to avoid attributing the influence of external factors, 
experimenter factors, and internal factors to the test stimuli. Furthermore, with 
no pretest given to the control group, there can be no interaction between the 
pretest and the test stimuli. This method, however, possesses certain draw- 
10O 




j ^ Experimental Design 

backs. Without a “before" measure, it is difficult to establish the degree of 
change in the subjects. In addition, the uncovering and testing of the function 
that describes the type of relationship involv’cd becomes even more arduous 
Of course, the before-after experiment provides no guarantees that such 
functions will be uncovered and tested. Nevertheless, it at least provides data 
describing the degree to which each individual changes, and this kind of 
information is needed to test predictions of change on an individual basis. 

Randomized-Groiips Design 

The four experimental designs thus far discussed all involve only one fx^ri- 
mental group. As a result the inferences that can be drawn from their test- 
stimuli data have been limited. With respect to the dependent vanable^.g- 
change in plans about sending one’s children to college the t ree ® 
after experiments all provide specific quantitative data as to the ma^itu e o 
change. However, these designs, and the fourth as well, on y provi e or 
testing of the relative effects of absence and presence of the ‘«t stimum 
Whereas this measurement incorporates the method of ' erence m 
the independent or test variable is concerned, it does not 
method of concomitant variations between the independent an P* 
variables. It is thus limited in the data it provides for an analysis ‘^e dcpce 
of relationship. This limitation can be avoided by setting up a ***■' j.ut, 
mental groups that vary in magnitude with respect to the in ei^n 
One of the most popular of such designs is the 
which involves the random assignment of subjects to eac o a^f^omized- 
groups as well as to the control group (if one is . 

groups design for two experimental groups and one control gr p 


' Stages 

f 1. Seletlion otiubjectj! rondomiiotion 

1 2. E»pefimental condition*- molcHing 
j ond/of roodomiiotion 
I 3. "Bsfore" meojuremenl (prete>») 

I 4. Expenure to test jtimuli 
I 5. "Atter” meoiuierionl (po»tfe»t) 


Expenmenfol Fxperimentol ^nfrol 
Group I Growp2 j 


The test stimuli may be a film of a certain length, and the len^ the film 
may be varied between the first and second 

then be analyzed by comparing the change resulting manner 

experimentar groups with that exhibited in the contro gr ^ compared 
the effects of difTe5ent degrees of exposure to the film may be compared. 



THE EXPERIMENT 


[Ch.5 


Factorial Design 

Thus far the discussion of cxpcnmciital design has been limited to the 
consideration of only one independent variable in each experiment. The 
factorial design has two or more independent variables, and each is varied in 
two or more ways Each combination of the variables is incorporated as a 
separate experimental condition. Suppose that one of the independent variables 
is the length of the film, with one condition being one hour and the other one- 
half hour. A second independent variable might be the major emphasis of the 
film as to the significant reason for going to college, with one emphasis being 
vocational and the oiher humanisttc. The factorial design for this experiment 
is as follows- 



Experi- 

Experi- 

Experi- 

Experi- 


mental 

menfol 

mental 

menial j 

Stages 

Group 1 

Group 2 

Groups 

Group 4 

1. S«t4eti«n of tgb- 
i«ei> by moirh- 
tng ond/or 





fondamireKoft 

1 3. Exparlmaniel 

Ve* 

Ye. 

Vet 

Y«i 

1 cenditlont. ; 

1 matching and/or 




1 

1 (ondsmisaiian 

1 3, "Btfort" meei- 

Y«v 

Ye. 

Yet 

Yet ' 

i 

1 urement (pre- 

f leitl 

1 4. Exposure to («]| 

Ye. 

Yes 

Yet 

Yet 

t 

1 sfimuti 

Ye.! on«.houi 

Yes. one hour 

Yen half-hour 

Yet holf-heuT 


fiimi vocoKoik 

Atm; hvmoniv 

Atm; vocation 

film; homanit- 

1 , „ 

o1 tmphaia 

utemphojls 

ol emphoiit 

lie empho.il 

j urement (post- , 

i test) 

Ye. 

Ye* 

Ye. 

Yes i 


By taking the average change between pretest and posttest for the first and 
third experimental groups and comparing it with the average change for the 
second and fourth experimental groups, we may assess the relative effective- 
ness of the vocational and the humanistic emphasis. And if we compare the 
average change for the first and second groups with that for the third and 
fourth groups, then we can also assess the refativc effectiveness of a one-hour 
film and a half-hour film. 

One advantage of the factorial design is that the investigator can combine 
several different hypotheses withm the same experiment and can also test for 
102 



^ Experimental Design 

interaction between the two independent variables. The concept of interaction 
in this context is the same as in the context of the before-after expenmen 
with two control groups (when interaction between the pretest and the test 
stimuli was discussed). 

LatinSquare Design 

The Latin-square design may be used in situations where it is desirable and 
possible to expose each subject to a number of different sets of test stimu i. 
In all the designs discussed so far, the subjects were exposed to only one 
“treatment” or set of test stimuli. A major advantage of Latm-square 
approach is the economy involved, for the total number of aval a e su j 
is used for each treatment, rather than being divided into sma er S^oups, wi 
the increase in variation between the groups resulting from t is re 
The Latin-square design constitutes a way of controlling t e e . 

subjects of the order in which they are exposed to the di ercn exp 
treatments or conditions. . .. 

The Latin square itself is a table of R roivs and r columns, each of 
the cells formed containing a letter. The letters are assigned in 
that each letter appean only once in each row and each co umn. 
is an example of a 5 X 5 Latin square: 


Each of the Ullcis A, B, C, D, and £ represents t> jitSerMl 
condition, and each row corresponds to a different subjec . u • . 

pictured above has to do with five individuals, each o w subiects 

five different experimental conditions. In this way, equa num ^ 

are introduced to a given treatment as the first in t e senes, treatment 
third, the fourth, and the fifth. Thus, the effectiveness of any g ven treatment 
may be calculated by averaging its impact over the vc i 
occurrence. 

The Field Experiment 

As in other methods of data collection, there is a ° V * jJjng 

of closure achieved or the degree to whKh variables o er expert- 

investigated are controlled in the experiment. The terms a 
me«r and field experiment are used to designate the ^ut 

continuum. Both approaches represent attempts to un ers 
the strategies differ. 



lCh.5 


THE EXPERIMENT 

Because the laboratory experiment often has a high degree of controt, the 
stage is set for the objective verihcation of whatever ideas are under investi- 
gation. If these ideas are fruitful, then a great deal of understanding can be 
achieved which can then be applied to the investigation of more complex 
situations which are closer to the fieM-experiment end of the continuum. The 
field experiment tends to incorporate fewer controls and to achieve less 
closure; as a result, objective verification suffers. Like other less-structured 
techniques, however, it tends to be highly suggestive within the context of 
discovery. 

Holmberg’s ‘‘participant intcrveailion” in the hacienda Vicos, located in 
Peru and rented by Cornell University, provides an interesting example of a 
field experiment.* Over a period of years a wide variety of innovations were 
introduced which resulted in the transformation of a traditional hacienda 
system into a new system of community organization based on shared inter- 
ests and local control. Significant changes also occurred in many other areas, 
such as income, education, production, and health facilities. 

One important advantage of this rescarch-and-development approach to the 
experiment is the immediate and comprehensive feedback of information on 
the effectiveness of interventions. This does not have the same significance in 
the context of justification as does the feedback resulting from an analysis of 
data and a testing of hypotheses in a laboratory experiment. But the oppor- 
tunities afforded within the context of discovery for the reformulation and 
continual development and improvement of ideas are great If one of the 
functions of research is to point the investigator loward the next research 
questions to be asked, then participant intervention seems to offer some 
unusual advantages in this area 

Of course, it is only to the extent that controls can be developed within the 
field situation that relatively definite answers to these questions can be ob- 
tained. In such situations all the sources of error previously discussed- 
internal, external, and experimenter factors, as well as various kinds of inter- 
actions— tend to be magnified. If one is dealing with clusters of variables at 
a lime, and if the unit is one culture or society rather than a number of 
subjects, it becomes difficult to provide objective evidence for hypotheses about 
the relationships among variables. 


SELECTED REFERENCES FOR FURTHER STUDY 

Roger Brown et at . New Direction in Faychology (New York; Holt, Rinehart 
and Winston, t962). The emphasis here is on content rather than method, but 
in the process a wide variety of significant experiments is presented. The content 
areas covered are attitude change, prychophysks, ethology, and emotion. 

• Allan R. Holmbetg, “^e ReKa;ch.aiMi.De)vlo{Hiient Approach to Change- Partici- 
pant tntervemion m the Field, m Richard N. Adams and Jack J. Preiss teds ), Human 
Orgonizonon Hesearc/i (Homewood. III.; The Dorsey Press, 1960), pp. 76-89. 

to* 



Selected References for Further Study 
Ernest Greenwood, Experimental Sociology: A Study in Method (New York: 
King's Crown Press, 1945). Although this volume does not incorporate the most 
recent ideas on the nature of the experiment, it is valuable for its thorough 
treatment of the experiment as it relates to a sociological context In addition to 
a thorough development of a typology for experiments, many illustrations from 
the sociological literature are provided. 

William S. Ray, An Introduction to Ex/rerimenial Design (New York: The Mac- 
millan Company, 1960). This book starts with first principles and gradually 
moves to a consideration of the complexities of experimental design and analysis. 
Some previous background in statistical inference is probably necessary, although 
the book does provide a review of this subjecL 


105 



Intorviews, Questionnaires, 
and Surveys 


6.t EXPERIMENTS AND SURVEYS 

The concept of siirvty in ordinaiy usage often carries 
public-opinion polls on political views or consumer ^*^'^ **■ ‘ „iu-. 

context of scientific research, the survey constitutes a method o a 
tion that utilizes interview or questionnaire techniques for recording 
behavior of respondents. It is neither limited to opinions or “ 
the sheer description of percentages of individuals who have gi^n 
istics. Instead, it constitutes an effective tool for getting at cause 

One important distinction between the survey and the 
the former does not center around an analysis of the effects o 
stimuli on the individual but focuses on the reconstruction o p ... 
occurred prior to the investigation. This difference gives survey e 
both potential advantages and disadvantages. On the jwsi ’ 

investigator can cover a wide range of phenomena an ,,^ev 

analyzing the effects of a few test stimuli; the potentia nc 
data is often illustrated by the contributions of , nurroses. 

lyzed data that have been collected by olhen for complete y i i-fomation 
Thus, survey data can be used to put together a great e ° . 

about an individual’s values, expectations, and swial relat ,o 

to investigate his behavior. Another advantage of the ^ 

the method is the relative ease with which the investigato* nooulation. 

eration of a probability sample of individuals from ^ < a laboratory 

The experiment is almost invariably conceived 
experiment, but it is entirely feasible to utilize situation, 

menters” to perform their experiments in the context o 
This would involve a randomization procedure that w 



[Ch.6 


INTERVIEWS, QUESTIONNAIRES, AND SURVEYS 
respondents being presented with some stimuli and other respondertts being 
exposed to diflerent experiences. The major limitation here has to do with the 
type of stimuli that could be efTectivcIy presented within an interview situation. 

An important disadvantage of the survey technique is its lesser degree of 
control over the data-collection situation and Uie resultant greater possibility 
for factors that are unknown to the investigator to interfere with the results. 
One aspect of control within the experiment has to do with the time sequence 
involved in the "before" measure, the introduction of test stimuli, and the 
“after” measure. Because it is the experimenter himself who introduces or 
assigns these factors, be has no difficulty in inferring whether a change in 
attitude came before or after the test stimuli. Another important aspect of 
experimental control has to do with (he processes of randomization and the 
existence of one or more control groups. The survey researcher cannot simi- 
larly deal with internal, experimenter, and external sources of error, although 
he does have some useful techniques for handling them. Thus, he can make 
his statements about relationships among variables with a lesser degree of 
confidence With his lesser degree of control, it is more likely that the rela- 
tionships he uncovers will be happenstance. 

One argument often used against the survey method is that verbal behavior 
is quite unreliable, tor individuals tend to say what they think is socially 
acceptable. This is a real problem for survey research, but it does not seem to 
be insuperable, The same kind of problem also occurs in the context of the 
experiment. If individuals can give socially acceptable verbal responses, they 
can also attempt to act in accordance with what they believe to be the experi- 
menter’s expectations. Another argument against the survey is that the Indi- 
vidual’s responses about bis own behavior are limited by his insight. This is 
true, but scientists in any held usually do not expect easy answers to difficult 
questions. Act meanings — i c., the individual's own definition of the situation 
— are in themselves impottant. U is too much to expert to obtain action 
meanings — i c., scientific explanations — from die respondent himself. Also, 
non-verbal behavior docs not automaiically provide us with action meanings. 

An argument often used by sociologists against the experiment is that it is 
conducted in "artificial" laboratory situations, and that consequently the 
results obtained cannot be generalized to more complex situations. The impli- 
cation is that the survey researcher deals more directly with “real” behavior 
by conducting his investigations in a more natural setting. One disadvantage 
to this as a general argument against the experiment is that there arc such 
things as field experiments, and that these can be performed within the con- 
text of the interview situation. It is, however, such prejudicial attitudes against 
^ expetimeni that prevent the further development of the field experiment, 
■^e argument also rests on a rather naive assumption that “reality” is rela- 
tively dose to the surface and is to be apprehended by avoiding abstraction 
and dealing with phenomena as they occur in natural settings. A reverse argu- 
ment may be equally well defended— namely, that so-called artificial labota- 



See. 6.2] 


Interviewer and Respondent 

tory situations are in fact closer to reality than natural settings, for the higher 
level of abstraction involved aids in obtaining closure and thus eliminates many 
extraneous factors that would otherwise obscure the situation. 

The point here is not that one method is generally superior to the other, 
but that each has certain strengths and certain weaknesses, an muc can 
be done to repair the weaknesses and extend the strengths of both. For exam- 
pic, references have been made to the possibilities of mcorpotatmE a taller 
range of data about the subjects within the experiment as »e as expen 
menting within the interview situation. . • ,-1, , < 

The data emerging from the experimental designs prescnic m ap cr 

are generally analyzed according to groups rather than indivi ua s. 

average change in attitude for the experimental group is compare wi 
for the control group, or the change in one experimental group is 
with that for others. Within these research contexts, only a small a"iount ol 
information about the characteristics of each individua is , 

on an analysis of his behavior within the experiment. Sue m oma 1 
quite useful if it can be theoretically organized. In fact, if it is . 

behavior of the individual Is a function of his own goals an • 

then it may be necessary to obtain this kind of information r 
viduals involved in the experiment in order to understand w a ® ‘-{j 

Some peycbologisu mish. heve reservation, abovl 
outside a laboratory situation. The two crucial factors inNO bed in the ^ 
meat, hoa^ever, ,e 7 m to be the Wsh degree of aoatrol iavolvcd 
of the experiment on the effects of stimuli introduced by •, p^jy 

Both factors can be prodoced aithia the coatert of f 
enough to randomize respondents so that equated group crucial Ques- 
subjected to different kinds of interview experiences. 
tion is whether the interviewer can present stimul, 
serve to be the focus of the inveslicaiion. U this 
problems, such as that of problem controlling the uni 
presented by interviewers, would not generally serve 

approach. 

6.2 INTERVIEWER AND RESPONDENT 

In the inicrview, as well as in the difhcult to 

aware of the interaction between researchers an su ]c • ^ 

predict the effects of any type of interaction un css ^ . knowledge 

about the values and expectations of the participan j^formalicn obtained 

.•w situation itself i 
ia"oi','ed"aiay lead W'safd ‘ 


n available, it can be used to interpret and awess t e . jj 

i, ,1,. m lire ihc imenies.- silusl.on i.seii a 


a the interview. It is abo possible to 
for an understanding of the dynamics 
understanding social interaction 


iderstandine of social interaction as a wnv’«i.. ,sout the i 

A. ,he pieiem siege cl scciel science, seo Wde » I"'-" 



[Ch.6 


INTERVIEWS, QUESTIONNAIRES, AND SURVEYS 
view and questionnaire situations. Some individuals feel that the degree of 
knowledge needed to effectively control what goes on in these situations will 
never become available, and that questionnaire construction as well as inter- 
view technique will always be something of an arc. More optimistic social 
scientists point toward the progress already made and look forward to further 
revealing studies on the interviewer-respondent situation. 

Standardized, Semistandardized, and Unstandardized Interviews 

Any consideration of the interview situation must lake into account the 
different types of interviews. These may be classified into three broad groups; 
the standardiied interview, the unstandardized interview, and the semistand- 
ardized interview. In the standardized interview, the interviewer is held to 
the specific wording in the interview question schedule; he is not free to adapt 
his questions to the specific situation, to change the order of topics, or to 
ask other questions, In the unstandardized interview, the interviewer is free to 
develop each situation in whatever ways he deems most appropriate for the 
purposes at hand. In the semistandardized interview, the interviewer may 
have to ask a number of specific major questions, but he may be free to probe 
beyond the answers to these questions. 

One way of understanding the relative utilities of each type of interview 
is to examine its contributions in the context of justification as compared to 
the context of discovery. The standardized interview seems to have certain 
advantages in the context of justification that arise from the relative uniformity 
in the behavior of the interviewer and the resulting ease with which other 
investigators can duplicate these interview situations. The argument for this 
approach is that it is necessary to know what stimuli produced the verbal 
behavior recorded if one is to develop an understanding of relationships 
among phenomena that can then be replicated by other scientists. 

The unstandardized interview seems to have advantages within the context 
of discovery. A skilled interviewer who is quite familiar with the broad pur- 
poses of the study may be stimulated by the answers of the respondent to 
develop new ideas about the phenomena under investigation. If he is not 
confined by the limits of a standardized interview, he will be able to explore 
any such ideas fully and thus go con^dcrably beyond the original formulation 
of the problem. In addition, he might find It necessary to change his general 
line of questioning if important results are not forthcoming. In the unstandard- 
ized interview, the interviewer can be quite flexible in adapting his approach 
to whatever appears to be most fruitful for a given respondent. For example, 
a standardized question might not be fully understood by respondents from 
cfffferenc social classes, and the interviewer can strive to alter the wording of 
the questions so as to achieve some equivalence in meaning for different kinds 
of respondents. 

Of course, each type of imcrview can contribute both within the context of 



Interviewer and Respondent 

jostiBcalion and within ths context of discovery. New ideas may 
an analysis of standardized interviews. And tt is also possible to P 

quantitative analyses of the materials from unslandardized interviews and to 
test various kinds of hypotheses. 

The Focused Interview 

Merton and Kendall have developed what they call a | 

which is unstandardized in the sense that the wor mg o que 
specified.' There is, however, a definite focus on the e ects o a 
nomenon experienced by the respondent. The interviewer 
view situation with a list of topics based on his previous 
problem and understanding of the phenomenon un duimcuish 

knowledge of the content of the phenomenon cna es ' 
between the stimuli to which the respondent was expose 
he actually received. It also allows him to dhcct his 
discover the kinds of perceptual distortions which resulted, 
of the factors which influenced degree of distortion. 

The Non-Directive Interview 

The acmdircclivc mrcrviViv is also ansrandardized, but bare the mlrnicwcr 

avoids channeling the interview in certain directions, n , ’ 

develop a very 4™issi.e atmosphere in whieh "^,"jl.“even 

fectly free to express his feelings without ^nce the respondent in 

the highly skilled non-directive Sk is tbrnlnimize 

various ways, but the purpose of the non-directive interview 
this influence. 

Social Desirability 

The various types of interviews differ f'’"’*"^'/'/^'°™„,°"icviet and the 
common characteristic, however, is the influc rcstwndenl’s 

iniersiew situation on the respondent. U "C , .. follows that his 

behavior is a product of his values and expectations, then it ton 
expectations will vary with the interview suuatirm. variable as it 

Edwards has studied what he calls the ^ rated by 

affects responses to questionnaires.* Persona i ) » whether you con- 

rcsnondentVwho were asked “to jud?t the traits in leims of wheUier y 


III 



INTERVIEWS, QUESTIONNAIRES. AND SURVEYS !Ch. 6 

sider them desirable or undesirable in others.” A difTerent group of individuals 
were then asked to indicate whether the items were characteristic of them- 
selves. A close relationship was found between the social desirability of a given 
item and the probability of its appearance among the self-ratings. 

If the findings in the Edwari study hold for interview situations as well, 
then much the interviewee says may be strongly influenced by his conception 
of the social desirability of saying it. The degree to which this occurs should 
vary from one interviewee to the next, and from one topic to the next. This 
variation would be related to the importance of this goal of appearing in a 
socially desirable light on the part of a given interviewee on a given topic. 

/Ibili/y and Willingness of Respondents 

There is always the possibility, of course, that those personality traits that 
are most socially desirable arc also the ones that ate most widely distributed. 
If, however, this is assumed not to be the case, there seem to be two possible 
explanations. First, it may be that respondents see themselves differently from 
the way others see them, in which case they are not deliberately exaggerating 
or distorting the truth, but, rather, lack insight into implications of their own 
behavior as it appears to others. Or it may be that respondents try to present 
a more favorable image of themselves, in which case they are consciously 
attempting to exaggerate or distort the truth. 

Although Edwards’ study focused on questionnaire situations, it also has 
implications for interview studies. Any Investigator must ponder a general 
question: How is one to take into account respondents’ unknowing perceptual 
or conceptual distortions as well as their conscious attempts at exaggeration, 
distortion, or falsehood? These are the two difficulties most often cited in 
discussions of the limitations of the interview technique. Although it has been 
suggested that the investigator focus a portion of the research effort on an 
assessment of the degree to which these factors operate, there remains the 
problem of identifying and developing those interview techniques that will 
help to minimize their effects. 

The focused interview seems to possess particular advantages in revealing 
conceptual distortion and Jack of insight. Because knowledge of the objective 
situation under investigation is available to the interviewer, he can detect not 
only the nature of the distortion but also its degree and its possible sources. 
The non-directive interview is designed to produce the permissive atmosphere 
(hat will minimize any motive on the part of the respondent deliberately to 
distort the truth. It is difficult, however, to determine how successful the 
technique is for a given respondent unless specific means are devised to check 
for truth or falsity. On this point, research on the interview situation itself 
can provide important information. To what degree, for example, is a given 
answer on the part of the respondent socially desirable in terms of his own 
value system? To what degree does the respondent bring other values to the 
112 



Sec. 6J] 


Inteniewer and Respondent 

inter\'iew situation which have to do, for example, with the importance of his 
telling the truth for the advancement of scientific knowledge? 

The ability of the respondents to present an objective picture of various 
phenomena — and, also important, their willingness to do so is signi can 
for interview situations in general. An individual who is not aware at is 
statements represent a distortion of phenomena as seen through t e eyes o 
others is providing valuable data on his own definition of the situation, or 
unless we understand the respondent’s way of looking at things, we wi no 
be able to understand his behavior. A view of the phenomena through ^e ey« 
of respondents will provide the investigator with data that cm e use , as i 
the focused interview, to identify the mechanism producing this istor ion. 


Interview Technique 

Conscious or deliberate distortion is another matter entirely, 
deal of interviewing technique centers on creating t c m o 
that will minimize the respondent’s need to distort. 

instructed to be very careful about expressing approva cMtures 

statements made by the respondent, and to reaction, 

and verbal responses when the respondent indicates a interview but 

These techniques are a cardinal principle of the non 1 Keve'rthe- 

less, the interviewer must tread a fine line, n ne is ' . .. v jqo 

respondent may look to him tor still be so minimal that 

reserved, his degree of interaction with the 

the trspondeni may not be motivate to interviewer not to try to 

One possible solution to 'f ^'^"e Tnmniewer behavior 

minimize his interaction with the responden . ,„Dondem in the inter- 
conslituics one of the most important rewar * should maintain 

view situation. At the same gestures or comments— 

a permissive atmosphere and avoid any j.„,v answer a “ves” or “I 

c.g.. a look of surprise in reaction to Tfje icndcncv of respond- 

agrec” spoken after the respondent gi^ iniervicwcr shows by word 

ents to distort the truth may also be reoucc ,1,^ information 

and manner that he takes the mtcrvrew very- ^ant to the study and, 

being obtained from the respondent is vi , _ _,,v serse to counterbalance 
prrhups, ,0 ,hc proprrss of scicuoc. This trohmquc may «rvc 
whatever reasons exist for distortions. 


te Initial Contact during 

These techniques for minimum? j^pondent when the immediate 

the initial contact between intcrvtcwxr ^f the respondent in 


■r and responacni 

w.c ....ua. vuukawk wv,.- cw>rcration of the respondent in 

task of the interviewer » to _ sometimes fee! uncomfortable 

undertaking the interview. Novnre ratcrviewe 



mTERVlENVS, QUESnONNAmES. AND SURVEYS 

.se, S,ve a..a..ag 

technique, however, is too inflcxibl individuals have the 

the variations in each a”"'"'' '^,„eous As in the interview situation 
ability to make mcmomed material s^l „5pond- 

itselt, the task ot the Ptrf some ot the more immediate 

ent is to convey the importance of t > rewards involve the re 

rewards that would result from aooperatmn. TO 

spondenfs interest in bei"e !»»'■>-“ ^ Lividnals simply 

Inner and personality of the f'" g as they get a chance to 

do not cate very much with whom \^y H personable 

talk, but most people would rather talk wiin so 

and interesting. _ effective in the contact situation 

Another important technique lha interview. For example, if the 

involves the structuring of the request m,r,view you?" he will 

interviewer asks- “Do you have an, "“I’.'”' ™ p„,ibi; help us out 
more likely receive a "No than if / j^tions now or at another 

in this research project, ^pondent is being asked 

time" The first approach ,, ^ fespondenfs attention is 

to give up something--nam.ly, "i? ^ j,Ji,io„, he may tend to 

focused on this value when “few The second approach intimates 

overlook the rewards involved in tteim^ e - inlerview to take place- 

,hat positive values w.ll '“f '”™p„,|%p,eeia,ion ot the interviewer, 
progress in scienliBc research and the personal upp ,„pbasi,es the te- 

The two approaches differ in another respec interview to take place 

"o'ntof. decision as to whether “ thn 

at all. whereas the second implies that the ' questions. Some 

time when he is to allow the li„^„te” le« 

interviewer, have been highly successful m obtamtngm ^ V 

a choice between the alternatives of ^'*"6 'nlcr approaches also 

week." By selling up dilfereut kinds of choices, the different npp 
structure the ease with whieli the respondent muy decline to ' . , 

Respondents seem to fnd i, more difficult to '''““f i„,er- 

,0 ask them questions than to decide that they do."ot liave tune o 
Viewed. Indeed the plea of “lack of time” constitutes a convenient 
for terminating the contact gracefully. 

The Recording of Data 

Thus far the discussion has emphasized the ways m which ‘ ® 
affects the respondent. Interaction, however, 'mphes a ,he 

therefore it is important to consider the impact of 'he resp 
interviewer. This impact has already been referred to in 
advantages of the non-standardized interview, in which the in 
given considerable latitude to design questions on the basis of the r p 



Sec. 6J1 


Survey Admuustraiion 


ent’s statements. One of the problems im’oh'ed in securing accurate data in 
any inteniew situation has to do with whether or not the interviewer actually 
perceives and records the statements made by the respondent For example, 
the interviewer may develop initial expectations about respondents' behavior 
on the basis of their answers to the preliminary questions, as well as throug 
obsen-ation. This “set” can then operate to prevent the interviewer rom 
correctly perceiving an answer that conflicts with these expectations. A tech- 
nique that can effectively eliminate this difficulty is the mechanical recor ing 
of the interview. Such an approach scn’cs to prevent not orily interviewer 
distortion but also substantial loss in the amount of the interview conten . 

The question as to how data might best be recorded by the interviewer 
deserves attention. Mechanical recording can have a good deal of impo anc 
because it reproduces the emphasis, intonation, pitch, and paus« o 
respondent as well as the entire content of the interview, including t e 
stimuli introduced by the interviewer. If it is true that the respon ent s 
behavior is in large measure a product of the particular interview ^ ’ 

then a mechanical recording can help to provide an accurate un ' 
of this situation. Furthermore, the detail such a recording furris es . . 
respondent has great potential value in determining how t e r ^ 
defines the situation. It may be argued that note-taking is ® .. 

procedure, but this may merely result from the fact that ve^ fj*. 
ever been based on mechanical recording. The impact of t is j.(.„nined 
reent on the spontaneity and veracity of the respondent _ qJ 

by research. Ejpericnce has aTready ' h„ime ot intro- 


recording equipment can result in very fesv refusals. The technique c 
decing such equipment «ould, of course, affect the refuse tar . ' 

Mualion might be appropriately structured using the might 

initial contact between interviewer and respondent. „ther than be 

be asked to suggest the best place for the recording equipme definitely 

asked whether a recording may be made. Ethical consi era lo achieved 
involved in the researcher’s choice of strategy. Is t e pm . _ might 
through such structuring sufficiently great to offset w a e 
result from the manipulation involved? 

6-3 SURVEY ADMINISTRATION 

Interviewing is often thought of as taking place 2 nd research 

3nd well-financed surveys with large numbers of i^q he found 

^alysts. This is not necessarily the case. The Interview 
Within the context of experiments. In that situation, t er respondents 

"ewer and one respondent, or there may be many o ea - popula- 

^lected may or may not represent a probability samp ® ° uy hand. The 

“on. The data may be analyzed by electronic computers or / 


interviews, questionnaires, and surveys 

choices made depend on tapoUan. than 

possess no special advantages over approach to sampling 

Lbitrary number of mterv,ev-s J^„^.selI. This attitude 

becomes necessary when the survey » non-orobability survey when, m 

militates against the ntiliaation ot ™ ^ I , ,esearch sitna- 

lact. it might be the most appiopiiate tool for a variety 

“°The large survey, especially when it is 

sesses important advantages wilhm the “““Unities is caused by the 
rise to various difficulties. Not the least „ay otganiza- 

many administrative problems aid retain a 

lions of different types that apecalire in performed on an ad 

staff of interviewers and analysts, most J ,ge study; to hire, 

hoc basis. It is difficult to ” “"StS m suprwise the processing 

train, and supervise a large number „ j„are of the short- 

and analysis of data; to deal with stall ^ activities within a 

term nature of their positions; , “.„f,nat those researchers 

specific time schedule. Therefore, it is not ^ ^ more time 

who are responsible for directing a large survey P J ^ ^ ^ their tech- 

on administrative duties than on activities more closely relateo 

nical competence. .. «r i«t*.rviewefs is a crucial aspect 

The selection, training, and supervision ( poorly moti- 

of survey research. The difference between the f .J^ted and well- 
vated and badly trained interviewers and that of ^ ^ 

trained stall affect the quality of the data ^tamed. ® pstablish- 

and intelligence on the part of the interviewer respondent) and 

ing personal contact (in order to gam the cropera ^^,2 

in maintaining this kind of contact during the f ,.j ^ and adapt- 

respondent motivated to provide a interview) g j^jportant 

ability are important during the training process. O nature and im- 

aspects of this process is the efTcclivc communication ^ 20m- 

porlance of the study, for the intcrvicvrer m turn must be cap 
municating this information to potential respondents 1 e occur 

cooperation. Because many of the problems interviewers exp .gaining 
during their first few interviews, it would seem advisable 
sessions while interviewing is in progress. Careful analysis 0 im 
will provide feedback information and reve^ any tendency on c p 
interviewers to make up data out of whole cloth. (Such cheating is q 
siblc in a poorly supervised study, and a given interviewer may haf^ y F 
cced to “interview" as many as thirty or forty “respondents in t is 
One effective supervisory procedure is to inform interviewers that ei 



Sec. 6.4] Construction of Questionnaires and Interview Schedules 

will be periodically checked through contact with the respondents, and then 
to perform such checks by telephone, by letter, or in person. 

6.4 CONSTRUCTION OF QUESTIONNAIRES AND 
INTERVIEW SCHEDULES 

Open and Fixed-Alternative Questions 

We may broadly distinguish between open or open-ended questions and 
closed or fixed-alternative questions. The open or open-ended question does 
not provide a list of alternative answers — e.g., “Why did you choose medicine 
as a career?” “What do you like most about surgery?” “How do you feel 
about America’s system of higher education’” “How would you react if you 
were proctoring an examination and saw a close friend of yours cheating?” 
The closed or fixed-alternatives question, on the other hand, limits the respond- 
ent to a choice among specific alternatives; e.g., 

Whgt impartanee would on Opperlunity lor dovetoping warm personal relationships 
with patients hove for you in your ideal fob? 

lndlspen$obl9_ ertremelv important very impertont foirly impoftorrt,^ 

little or no importance (or would rather net bave>_ 

Each t>’pe of question has its advantages. The open question, because it 
puts very few words in the mouth of the respondent, is more effective in 
revealing his own definition of the situation, whatever it is. The closed ques- 
tion is, in this respect, analogous to the “leading questions” which are not 
permissible in most aspects of courtroom procedure. If the respondent does 
not understand the question or if he grossly misunderstands, this too will be 
revealed in his answers to open questions. In addition, the phrasing of the 
open question is closer to that used in ordinary conversation and, as a result, 
may encourage spontaneity on the part of the respondent as well as reinforce 
his motivation to communicate effectively and thoroughly. The respondent 
may react negatively to fixed-alternative questions because he is not able to 
give highly individualized answers; he may even feel that such procedures are 
inaccurate and thus come to question the value of the interview. 

On the other hand, the fixed-alternative or closed question produces greater 
uniformity among respondents along the specific dimensions in which the in- 
vestigator is interested. Consequently, it aids greatly within the context of 
justification in that the investigator is assured that he will be able to obtain 
relatively complete information from his entire sample about the specific phe- 
nomena with which he is concerned. Many answers to open questions are 
not useful in testing specific hypotheses because they constitute responses 
along many different dimensions. For example, one student might answer the 
question about why he chose medicine as a career by referring to his own 
goals; another, by referring to his expectations about the favorable character- 

117 



.nterviews, quest>onna.res. and surveys 

u„cs ol .he Beld, mother, >’? who hove inBoenced 

other fieUs; and suit another, *’1' is quite valuable withtn 

him in this direction Although this vane y s-gcific hypotheses difficult. 

Sie contest ol discovery, it "f “/n'aSMt the nVe oMoscd questions: 
There is also a practical 0O"vdcta^auacM 

the costs and time involved in yj„, for the researcher is to 

than those in the ease ot open ^ Election while taking into ac- 

create the best possible instrument lot data collection 
count his limitations with tcspcct to tcsootccs. 

The ■TP/iy?"QiicRion»"'< of 

The -Why’” question, a type ot ^n q^ ^j„,is,s, and a suggested 
patttcular attention on the part o -.lions is called reason analysis. 

procedure for asking and analysing such _,mesling or preliminary 

The approach involves a careful “fi,rb 'given in response to 

interviews, of the range of possible "3' *J^2„i,uer might isolate tour 
. particular "Whyr question. For During the aclua 

different kinds of reasons tor choosing “ i" '“i.. s,iq„ by a series of 
interviewing, he might follow 'J' 8'"'" ensions . re omitted 

questions designed to erplore whichever ol '1''“^“ , j5s„ibe the 

by the respondent In addition, the interviewer in gmaltempU 

choices through time and diffetcmiate '1*^“ . [ mfotmation in 

ones. Finally, he would attempt to put together a he piece 0 
an ouOine of the dynamic relationships among all t ..tonneT' 

Reason analysis provides an illustration ' studies in- 

technique in questionnaire- and ' .J j ,he'^ investigator 

elude a range ot questions— some open, some clos technique 

must decide on an ordering of the -1“^' J “ „hich are 

specifies iniTial questions which arc open and sobseque q t,„,lysis may 
more specific and are generally of the closed vat y. specific 

begin with the general "Whyr’ question and >*"" P'“"^'? ' . is that h pet- 
aspects of the choice process. The advantage of this e q .shUe 

mils an analysis ol the respondent’s spontaneous hame of m 
assuring that data will be collected from each respondent on 
desired topics. 

SomcTacticj 

In tVip rnnRfrnt’.tion of ouesUonnurcs and interview schedules it is 


In the construction of questionnaires and interview scneuu.ca . ^6 

important that the verbal stimuli presented by the interviewer j 

be a, eka, as possible. There is alwaRi n degree of vagueness attached to 
guage, but it is important that the researcher try to minimize vag 

‘The initialion of these ideas is credited to Paul F. 131-74 

Hans Zeisel. Say I, \V„h Figuft, (New York- Harper and Row, Publishers, pp. 

(1957) 



Sec. 6.4] Camtruction of Questionnaires and Interview Schedules 

ambiguity. Unless this is done, respondents are, in effect, being presented with 
varying and uncontrolled stimuH and the significance of their responses would 
consequently be difficult to assess. The investigator can promote clarity by 
avoiding long questions and words with double meanings; specifying the time, 
place, and context the respondent is being asked to assume; prefacing un- 
familiar or unusually complicated questions with an explanatory paragraph 
or an illustration; and asking questions in terms of the respondent’s own im- 
mediate and recent experience rather than in generalities.® 

Although it is very important for the interviewer to create the kind of at- 
mosphere that motivates the respondent to be as candid as possible, it is also 
important for him to phrase his questions in such a way so as to encourage 
veracity. This is of special importance when an interview deals with sensitive 
topics — i.e., where a given answer has a very high or very low degree of so- 
cial desirability. Among the guidelines developed for interviewers are the 
following: 

1. Indicate that other people have what might ordinarily be considered to 
be a socially undesirable attitude or characteristic; e.g., “Most people 
have thought about suicide at one time or another.” 

2. Attempt to achieve some balance of social desirability among choices; 
e.g., “Some political leaders believe . . . while other political leaders 
believe. . . .” 

3. Structure the question in such a way that the respondent is assumed to 
possess the socially undersirable characteristic, thus placing on him the 
burden of denial. (For example. “How much are your monthly pay- 
ments?” might be preferable to “Have you purchased anything on the 
installment plan during the last two years?”) 

4. Substitute euphemisms for more value-loaded language. (For example, 
training methods might be superior lo methods of discipline in discussing 
child-reari/ig. ) And, in general, try to avoid words which convey a high 
degree of social desirability or undesirability. 

5. If it is desired that the respondent express criticism of something, first 
provide him with an opportunity for voicing praise so that he will not 
feel that he is being unfair. 

6. Structure the question in such a way that the respondent wifi be able to 
admit gracefully that he is not familiar with something or that he has not 
yet formed a particular expectation. (For example, “Do >ou have any 
leelings about how X should behave'?” might be preferable lo "How do 
you feel that X should behave?”) The funnel technique provides an ad- 
ditional mechanism for achieving this effect.* 

* Ttiese sugreslionv are laVen from EleaiMM* E. Maccoby and Nathan Maccoby, “The 
IniervKw A Tool of Social Science." in Gardner I tndzey (ed ). Handbook of Social 
Pijcholiinr. Vol 1 (Cambridge. Maw: Addbon-Wcvley Publishing Co, p. 456, J954). 

‘ This !nl h based on ideas presented in Macci*y and Maccoby. op. cit.. p. 457. 

It? 



INTERVIEWS. QUESTIONNAIRES. AND SURVEYS 


lCh.6 


Question Order . • 

Within the questionnaire or ot O"' 

emits a series of stimuli, many of which are in t . Qjjer are based 

type or another. The guidelines used for determining q ^ pj,gs Some 

on the possible effects a given stimulus might have on s q 
of the rules of thumb in general use arc: 

1 . On a given topic, general questions should usually qoes- 

This advice is in accord with the funnel technique, i .sample, a 

tions are open and subsequent questions are dosed. . ^^ns for 

respondent were queried on whether A constituted on he 

his political preference prior to being asked what is ^^^he 

might be tempted to include A among his reasons i po 
more general question. . , ..Her so 

2 The entire sequence of questions should follow some 

that the respondent is not called upon to make abrupt tran 
so that the sequence aids him in answering the questions. 
logical order is the time sequence, the respondent being as e 
past, the present, and the future in that order. Another proc 
move from the more specific or familiar to the more abs ta 
familiar. , imoor* 

3 . Some questions are of such a nature that they might exert ,..-j 
tant effect on all subsequent questions. For a given ^ . ,1,5 

about income or religion might $0 antagonize the responden 
remainder of his responses would be greatly affected. Sometimes a q 
tion that reveals the guiding purpose of the study can affect su seq 
answers. An example of this is provided by the medical student s • 
Care was taken to avoid asking more questions about public he t 
about other medical fields until the very last part of the 
This procedure of deferring such questions for as long as possible is 
one generally adopted. 


Questionnaire and Interview Compared 

The most obvious practical diflercnce between the questionnaire and the 
interview survey is cost. Within the medical student study this was the mos 
important consideration governing the choice of a questionnaire instrumen 
over the interview schedule, although other factors were involved. A relate 
factor was the considerable time required to obtain a very large number 0 
interviews. It would take a small staff years to interview thousands of indi- 
viduals. If more immediate results are desired, then a special staff of inter- 
120 



INTERVIEWS. QUESTIONNAIRES. AND SURVEYS 


lCh.6 


Question Order . • 

Within the qucslioonnite ot qmTlions ol one 

emits a series of stimuli, many of which arc nuestion order are based 

type ot another. The goidchnes itecd tor S q , Some 

on the possible efiects a given stimulus might have on sud h 
of the rules of thumb in general use arc: 

1. On a given topic, general questions should usually ques- 

This advice is in accord with the funnel technique, m ^ q ^ 
tions are open and subsequent quesUons are • jgasons for 

respondent were queried on whether A consuiuted o are, he 

his political preference prior to being asked ^ resoonsc to the 
might be tempted to include A among his reasons in po 
mote general question. order, so 

2 Tht entire sequence ot questions should follow soTO 1 S 

that the respondent is not called upon to make . common 

so that the sequence aids him in answering the questi • . ^ 

logical order is the time sequence, the respondent being as 
past, the present, and the future in that order. Another . 

move from the more specific or familiar to the more a s r 
familiar. imnor- 

3. Some questions are of such a nature that they might exert 
tant effect on all subsequent questions. For a given study, q 
about income or religion might so antagonize the respon cn .j. 
remainder of his responses would be greatly affected. Sometimes a q . 
tion that reveals the guiding purpose of the study can affect su seq 
answers. An example of this is provided by the medical * . 

Care was taken to avoid asking more questions about public hea l 
about other medical fields until the very last part of the 
This procedure of deferring such questions for as long as possible is 
one generally adopted. 


Questionnaire and Interview Compared 

The most obvious practical difference between the questionnaire and the 
interview survey is cost. Within the medical student study this was the mos 
important consideration governing the choice of a questionnaire instruroe 
over the interview schedule, although other factors were involved. A relate 
factor was the considerable time required to obtain a very large number o 
interviews. It would take a small staff years to interview thousands of indi* 
viduals. If more immediate results are desired, then a special staff of inter* 


120 



Sec. 5-5] Projective Techniques and Other Indirect Methods 

ob%’ious to the respondent, but they depend less on the assumption of pro- 
jection. 

Another tjpe of projective technique has to do s^ith asking the respondent 
to play one or more roles. This technique is called psychodrama if he plays 
himself, sociodrama if he acts out the roles of others. Here the respondent 
is removed from the paper-and-pencil situation, and has relatively little time 
to prepare an answer he thinks would have social desirability. If the inter- 
viewer himself adopts a role which complements that adopted by the re- 
spondent, he has the opportunity of subjecting the respondent to sequences 
of stimuli and to investigating the dynamics of his behavior. 

One of the difficulties involved in the use of projective techniques is the 
problem of developing reliable methods for analyzing the data produced. In- 
direct methods, which arc more structured and which consequently tend to 
have a higher degree of reliability, have been reviewed by Campbell.^ In the 
error-choice method,* for example, respondents are asked to choose among 
different possible answers to a factual question (such as “What is die salary 
of the average psychiatrist?”). All the answers provided ate incorrect, and the 
direction of the error a respondent makes is used as a measure of his values, 
attitudes, or expectations. Another relatively structured indirect test is the 
information test, which is based on the premise that the amount and kind 
of information an individual possesses aWut a given subject are related to 
his values and expectations. Consequently, information questions may be 
utilized as Indirect measures of these dimensions.* 

Questions have been raised as to the validity of the various indirect tests— 
i.e., the degree to which they actually succeed in measuring those character- 
istics of the individual that are not easily measurable by more direct methods. 
Campbell states that the literature is almost devoid of evidence indicating that 
indirect questions provide more valid data than information produced by 
direct questions.’** Knowledge of the interview situation, however, seems to 
indicate that the respondent's lack of insight or his interest in going along with 
what is socially desirable can seriously jeopardize the quality of interview 
data. Thus, although indirect tests generally have not yet proved their reli- 
ability and validity, the issue is by no means closed. Furthermore, there is 
no reason why reliable and valid indirect techniques cannot be developed. 
(Criteria used for establishing reliability and validity will be discussed in 
Chapter 10.) 

’ D T. Campbell. "The Indirect A^essmcfU of Social Attitudes,” Psycliologictil Bui- 
IfH/i. XXXXVil; 15-38 (1930). 

* K. R. Hammond. “Measuring Auiludes by Error-Choice, and Indirect Method," 
tournat of Abnormal and Social Psychology. XXXW. 38-48 (1948). 

• This indirect method is discussed by T. M. Newcomb, “The Influence of Attitude 
Climate Upon Some Determinants of Information.” Journal of Abnormal and Social Psy- 
chology. XXXXll: 291-302 (1946). 

*“ Campbell, op. of., 30. 


>23 



INTERVIEWS, QUESTIONNAIRES, AND 

situation that will motivate the “ ptn'ses. This may occur, 

such motivation may be msuiricient to ^ questioned about aspects 

io, example, in a situation “d’e red hiehl, undesirable, 

of his own behavior that respondent is no longer mi^ 

By employing an indirect meth^, ervatc that it reveals 

rivaled to conceal intormahon becaus e Im^ts H ot 

behavior he considers to be ^lally u • j „orld ot the respond- 

sentan attempt to penetrate the data (as distinet Irom 

ent, to increase his abtlhy “'“"‘"“f,;, assumption that (aetom 

his willingness to do so). Implied in sue iiuportant eliccts on the't 

beyond the awareness of individuals may P ,i,e 

behavior These factors may include values or goals 

individual’s own View of himscM. _ _,»,hods that involve the ptes- 

Projective techniques are dala,^ncction „„dent. ft is assumed 

entation of relatively unstructur^ stimidi w b requited to 

that, because the stimuli ate ““""““'f/ " r, ,„«! important aspects 
organiie ot structure them and will, P'“ ■ J ,„|,„iques were orig- 

of his own personality and behavior, "'boug ’ . ^ „„re 

inally utilized in clinical “ 

general social science research. The Rorschach 

projective test that has been ,r®, picture of an 

'preUted with a series of cards, each a com- 
ink blot, and ate asked. "What mi^t to be? l„„g,ch 

monly used projective technique is the '''’/’rSes of pictures. The 

the respondent is asked to tell a story about each of a senes ot p 
ink blots and pictures constilute the “g" ™ ,„,i„g of the stimuli 

respondcofs answers to the questions represent b'S » '"““""j easily 

and thus (it is hoped) reveal personal characteristics that 

ascertainable by means of more direct questioning. unstruc- 

Projective techniques vary m the degree to which the 
tured. The Sentence Completion Test is generally more structure 
the Rorschach Test or the Thematic Apperception Test. In this app 
respondent is asked to complete a series of f^’jher made him 

" Because the stimuli con- 
sist of words rather than ink blots or pictures, the investigator can f 
attention on a much greater range of phenomena. For examp e, i 
tcresled in the values of the respondent, sentences such as those S"' 
samples may help him to uncover them. The items can vary voung 

assume that the respondent will project his own characteristics ( y 
person doesn’t like to be told . 


to those which are more direct (“My favorite 


more oircci t rviy lavumc 

.”). The purposes of the more direct questions are m 



Use of Documents 


7.1 INTRODUCTION 

Documents will here refer to any written materials that may be used as 
a source of mformation about human behavior. Although written materials 
are involved in experiments, surveys, and observational studies at one stage 
or another, attention u ill be focused on those materials the recording of which 
has not been the result of any special effort on the part of the investigator. 
This includes, for example, population censuses, previous suneys by social 
scientists, health statistics, economic siausUcs, records of voluntary organiza- 
tions, court records, documents collected by social agencies, school records, 
documents collected by personnel departments in industry, legal papers, news- 
paper and magazine reports, radio and television tapes, motion picture 
scenarios and play scripts, fiction, soap, poetry, folklore, laws and regula- 
tions, editorials, speeches, letters to the editor, pamphlets, essays, treatises 
by social scientists (including histories, biographies, articles in professional 
journals, monographs, textbooks), autobiographies, diaries, personal letters, 
notebooks, and memoranda. 


7.2 THE HISTORIAN’S APPROACH TO DOCUMENTS 
The Establishment of Credibility 

Historians have developed various procedures for the analysis of docu- 
ments. One of the problems has involved the development of procedures for 
assessing the validity o{ truth of the written materials contained 'm documents. 
These efforts may roughly be divided into techniques of external criticism 
and those of internal criticism. External criticism has to do with procedures 
for distinguishing between a hoax or misrepresentation and a genuine docu- 
ment, whereas internal criticbm is aimed at establishing the degree to which 
a particular document is credible. Although the ultimate concern is with truth, 

125 



INTERVIEWS, questionnaires, AND SURVEYS 

SELECTED REFERENCES FOR FURTHER STUDY 

HERBERT H. Hyman, to P'n““' 

ChiEBSO Pr.», .954). A .s >>'.t 

interview, n. well ns the rlyntmiRa of Hie totm . „ incorporated. 

A co„prehen.,.= review o. reaearch Ir.eralure on the totorv 

Gardner Lindzey (ed.). Handbook of ‘’1954). This com- 

Method (Cambridge: Addison-Wesley interview, socio- 

prehensivc volume includes chapters on ** J Ueatment focuses on 

metric techniques, obseiYalion. and “““* a'nd many reference, 
the availabiebody of data abon. the r^areh ^ ,i„,i,ed number 

are provided. The di.eoi.ien. are fairly Ihoroush m .pile 

of pages devoted to each technique. _ 

Charles H. Backstrom and GebaIO D. ”dRBM, purvey ReaearcME ^ 

Northwestern Univeraily Pre.s, 19«), “^“tol proHems faced by the 
readable manual dealin, w.lh many el the P'“““' P pmeedure i> 

survey researcher. Much of th.ir detailed descfipuou. of "toeP P 
difficult to find oolside of the article lileialute, especially m 
standable to the uninitiated 


124 



Sec. 7.2] The Hislorian's Approach to Documents 

ally, these approaches are not so far apart as they may appear, for many 
historians are interested in going beyond the particulars to general proposi- 
tions, and they attempt to make use of available knowledge from other social 
sciences. 

As a result of these differences in orientation, documents that illustrate the 
application of the first test (t.e , ability to tell the truth ) provide important data 
in their own right for the socioIogisL The historian who is concerned with 
establishing credible statements about historical events may discard state- 
ments that reveal an inability to tell the truth. Yet the sociologist tends to be 
quite interested in such statements, because they reveal the individual’s per- 
sonal definition of the situation and are consequently invaluable in arriving at 
an explanation of his behavior. For example, the following is used to illustrate 
situations that are conducive to lack of credibility from the point of view of 
the historian: 

Expectation or anticipation frequently leads a witness astray. Those who 
count on revolutionanes to be bloodthirsty and conservatives to be gentle- 
manly, those who expect the young to be irreverent and the old to be 
crabbed, those who know Germans to be ruthless and Englishmen to lack 
humor usually find bloodthirsty revolutionaries and gentlemanly conserva- 
tives, irreverent youth and crabbed old age. ruthless Germans and humorless 
Englishmen.^ 

These very expectations, which may lead to statements lacking in credibil- 
ity, constitute important aspects of the individual's definition of the situation. 

Survey researchers and social scientists concerned with the analysis of docu- 
ments would do well to pay attention to the advice of the historian concerning 
the test that has to do with the willingness of the witness to tell the truth. In 
addition to the analysis of willingness, the sociologist is often in the position 
of securing external corroboration (the fourth test of the historian) from the 
statements of other individuals or on the basis of his own observations. Thus, 
for example, the technique of the focused interview prescribes that the inter- 
viewer himself be familiar with the phenomena under discussion. The his- 
torian’s technique of not relying on statements unless they are based on the 
independent testimony of two or more witnesses seems quite sensible on the 
basis of what we know of human behavior. 

In the process of collecting his own documents by means of interview sur- 
veys. the sociologist is in an advanfa^ous situation for obtaining such cor- 
roboration. The fact is, however, that procedures such as the focused interview 
or the interviewing of several respon^nts within a given family or group on 
the same subject are not frequently employed. To the extent that the social 
scientist attempts to analyze the factors leading to veracity on the part of the 

* p. 42. 

127 



ICh-l 

USE OF DOCUMENTS „Mk,ically attempts 

0, wha, actually happend in h.stoo,, .he „e "as dose .0 

to establish credibility— i c.. statements examination ol the 

what actually happened as we can learn front a 

lave developed within the context ol internal eritteism 

are. 

1 Was the ultimate source ot the detail (the primary witness) Me to te 

the truth? u . .ir? 

2 . Was the primary witness willing Xo tell ™ ™ the detail under 

3 Is the primary witness accurately reported with r g 

4. “etfexrerne, eorrohorerlon o, due detail under examination^ > ^ 

It should be noted that the tests “‘l "‘j"'”^,o“iew'tuoato^ A 

refer to those factors that consutulc difficulties m the . be- 

respondent may not have sulBeicnt ‘“' 8 '“.'"'°^“ 'J” [ ^,„mion, or he may 
havior ol others to be able to provide certain " S Stahihty- In 

provide distorted inlormalion in an ellott >» »ehieve be 

addition, witnesses may be too remote 'tht" the event, and 

incompetent observers or may have pmd litllc n™"'”" ' j,b,e statements, 
consequently they may not have the ability to p interested parnes 

Their willingness to tell the tniih may be reduced il they ar= ^ j; u*, 

to the conclusions, or il what they say will please or di p veracity- 

or convention, (eg,, libel laws) require .hem to p,i„my 

The third rest has to do with secondary evidence. 11 sta ements y 
witness arc unavailable, then slalcmcnts by secondary fourth test 

evidence that must, ol course, be evalualed for US „„ the 

lollows the general rule ot historians to a^ept only lacts g, („.:nn looks 
independent testimony of two or more witnesses, and thus t 
for corroboration of a given slatcment. 


Tlte Historian and the Sociologist ^ 

Although there are similarities between Uie techniques of 
historians for assessing documentary data, there are dilfercnces m e P 
The sociologist tends to be more concerned with analyzing siatcmen s i 
to use them to develop and test propositions that will explain an P 
human behavior in general, whereas the historian often is intereste m ° _ 

ing a set of credible statements about particular historical occurrences. 


* Louis Coilschalk, ClyJe KlueVhohn. and Robert Ansell. The Use of 

mdfi«mHidforj..<n/ftropf,/c.g7andSoctotoCT.BuU«linno 53 (New York: Social bcien 

Research Council, 1945, p. 35). 

’ibid., p.38. 


126 



Statistical Documents 


Sec. 7.4] 

the controversy that raged over TIte Polish Peasant in Europe and America. 
The ways out of this difficulty are several, as in the situation with respect to 
the analysis of personal documents. One of them is to utilize these data only 
within the context of discosery and not tutempt to say anything within the 
context of justification. Another is to define the units of investigation so that, 
in spite of the existence of only one case at one level of analysis, a number 
of cases exists at other levels. A third way out of the difficulty is to collect 
more cases or instances, and this may be done with respect to personal docu- 
ments or social systems. 

7.4 STATISTICAL DOCUMENTS 

Some Standard Sources 

Sociologists are weU aware of the potential import of such data as census 
statistics, partly owing to Durkbeim’s classic study of suicide.* Durkheim used 
several kinds of documents and, in spite of the unreliability and general in- 
adequacy of many ol them, succeeded in putting forward and testing proposi- 
tions that are still deemed to be of considerable importance. The modem social 
scientist has at his disposal far more reliable documents on many more aspects 
of human behavior. 

The United States government, for example, periodically collects a great 
deal of statistical ioformatioa. Among these data there are the periodic cen- 
suses of agriculture, business and industry, foreign trade, governments, and 
the decennial censuses of population and housing. Summaries of much of this 
information are contained in the annual Sfaiisiicat Abstract of the United 
States and in Historic^ Statistics of the United States (Colonial Period to 
1957). la addition, the U.S. Bureau of the Census publishes the Current 
Population Survey, based on a sample of about 35,000 households and issued 
a number of times a year. (Special tabulations are usually provided by the 
Bureau to researchers at cost.) Among the many other governmental publica- 
tions of interest to social scientists are the Monthly Labor /Review’ (U.S. Bu- 
reau of Labor Statistics) and the National Health Survey and the monthly 
Indicators (U.S. Department of Health. Education and Welfare). 

Census Tract Data 

These publications have in common the aggregation of data about large 
numbers of individuals and the description of these aggregates by means of 
such statistics as frequencies, percentages, averages, and ratios. The types of 
aggregates depend on the purposes of the investigation. One unit of analysis 
that has proven to be of special importance to the sociologist is the census 

* Emile DurVheim. Suicide, iranslaied by George Simpson (New York: The Free Press 
o{ Glencoe, I9S1}. 

129 



lCh .7 


USE OF DOCUMENTS . 

respondents, and to the eittent that he W “ij „h;ther 

ch^ge sontetintes "felllSy answered, 

statements by respondents are valid— may uc cue 


7 3 THE SOCIOLOGISTS APPROACH TO PERSONAL 

documents 


All documents ate personal in the sense to»ev«— s*'' 

beings and reveal something abool human behavior. jj_tcnd to 

as autobiographies, diaries, and P^h'l ImLsc who have written them, 
reveal more of the personality and behavior of those who 
and the term pertonnl doeninenl has these “nnoMons Z„ani=cki, in The 
The controversy over the methods used JT ”““y compromise 

Polish Peotmil in Europe miii dmerifii. 'lie t g ^5 

have previously been discussed in Secuon - E documents than they 

that sociologists tend to pay tos “J' ",”inB very much withi” 

once did. One ol these has to do with the diflieully of saymg D 
the context of iustification on the basis of the f ° ' f „[ discovery, 

The importance of personal documents within . a . given diary 

however, should not be neglected. Although U is .g, jhe chances 

may have been written with the intent to_^cciye the fu ^ individual 

are that most such documents arc not The «t«at.on.n which the^.^ 
writes a diary seems to be one in which factors conduc situations, 

tell the truth tend to be stronger than they are in most int 
where the social-desirability variable may operate effectively. , ^jopo* 

The argument that it is difficult to provide much evidence for g 
sitions about human behavior when only a single indivi ua analysis. 
on what Is dcfiDcd as the unit of anai>s. 


I isuiiiasi u,..ss...>d. ........ -I Single inuiviuuas 

only partially true. Much depends on what is defined as the unit of ^ 
If the unit is the overall personality of the ind 


t IS oeiincu as i*"- -- 

If the unit is the overall personality of the individual, the ^ vvill 

on this either as an independent or a dependent variable, an e ij^vesti- 

in fact be only one case with its resulting Umitations. If, however, jjj 

gator is analyzing the effects of a series of experimentally in uce *'. ,.^^^21 
the individual’s values or expectations on a series of choices t c i 

makes, then the research no longer involves a single case. the ner- 

Although sociolo^sts tend to worry about the single case where 
sonal documents of individuals are involved, there is much less ten en ^ 
be concerned over the limitations of a single case where the focus is 
yven social system. Where only one unit is involved, whether ** ^ 

individual or a given social system, what can be said within the contex 
justification becomes quite limited. For, in effect, the single case 6 "'^® ^ 
merely a “proof” by example, and proof by example (according ^ . j 
Jewish proverb )is no proof at all. Different investigators can easily 
examples to “prove” opposite hypotheses, and this fact gave rise to muc 



Sec. 7.4] 


Staiisiical Documents 


sections of a given tract. The investigator cannot assume that, if he selects a 
small sample of individuals from a tract where the average income is high, he 
will emerge with a high average income for his sample. 

Another important limitation is that census tract information is incomplete 
for certain purposes. For example, no data on religion is presently available, 
although it is possible to obtain rough estimates about the distribution of the 
religious groupings from other characteristics— e.g , race, ethnic background 

The researcher might also utilize such data to determine the social class 
composition of a given tract. We see, for example, that Tract MC-21 has a 
median family iheome of S8,080, that the median number of school years com- 
plete for individuals 25 or over is over 16, and that there are 1136 individuals 
whose occupations carry the highest social status (professional, technical, 
managerial, etc.). Tract MC-7, by contrast, has a median family income of 
$5042, a schooling median of 9.1 years, and only 81 individuals in the occu- 
pational grouping of highest status. The two tracts vary only slightly in total 
population and only somewhat in the number of individuals in the age bracket 
21-65, so that the difference in the number of individuals m high-status occu- 
pations cannot be accounted for by the difference in the number of employed 
individuals between the tracts. This assessment of the differences between the 
tracts in social class composition is in accord with other facts about the tracts 
— c.g., MC>21 is adjacent to Harvard University and borders the Charles 
River — and MC-7 has, by contrast, numerous commercial establishments. 

Many different types of studies can be done with census tract data: 

1. They can be combined with questionnaire or interview data to provide 
objective information about the social milieu of the respondent. The 
major problem here is that the census tract is not a homogeneous unit, 
so that his actual social milieu (e.g., the social class and ethnic back- 
ground of his neighbors) may vary considerably from the average for his 
tract. 

2. One can investigate the extent to which various characteristics are asso- 
ciated with one another. Thus, for example, we see a direct relationship 
between median number of years of schooling and median family income 
for tracts MC-7 and MC-21: in MC-7 both of these are relatively low, 
whereas in MC-21 both arc relatively high. Data from a large number 
of tracts may be examined in this way to see whether the relationship 
holds (existence of the relationship), how close it is (degree of relation- 
ship), and whether or not it can be accurately predicted. Such investiga- 
tions may be directed by hypotheses and theories just as are studies 
utilizing other types of data. 

3. These data are valuable for studies of social change. Wc sec from the 
above data that MC-7 had a population of 3055 in 1950 and a popula- 
tion of 3305 in 1960 — a gain of 8.2 percent. MC-21 moved from 3985 

}31 



lCh.7 


USE OF DOCUNIENTS , 

o, a,=. w:.a.a a Pa"-- ^ 

can provide numerous '"f of dala available and some of the 

within a given area. To illustrate the Via J ^^mparc a number of 

differences that may occur ? 'brijgc. Cambridge is part of 

facts about two specific tracts in the a y metropolitan areas in the 

the Boston metropolitan arca-^nc "rpopulati^ 

United States '■ Both tracts arc approximately equal m pop 


Choraslerislicj of the Popolotion 


Trod 

MC-7 


Tract i 
MC.2» , • 


Populalion 1960 
PopulaU<int960 
Pcrcenloge ol chongc 

Mq 1« populolion 
Non-whiie popublio" 

To*oltof«'9i'Uos'‘ 

Median ichoal yean eampU'ed fa* t**®** 
iweniy-five ar over 
Median (amity Income 


(623 

B4 

1737 


Age arid Economic Choracterislic* 
Petsoni wftder five 
. Penont ilxty-five and ever 
Penonj Iwenty-one and aver 
Prafenionol, technical, manogetiol. a(fioaU, 
and proprielort 
Clerical and sale> 

Craftsmen, (oremen 


677 

2694 


Open 


Private household, servi 


gm oe or c.uuixu--*"''"- , jj 

searcher who is doing a community study of wishes 


searener wno is oomg a c-ommuimji - . 

would generally be of considerable aid in locating groups an ar 
to study. Rather than relying on less objective data, such as t c ^ 
mates of city officials or private organizations, he can turn to sue c 


data. 


These data also have their limitations for such purposes. ^ ® ^ within 
decennial one, and many characteristics of the population can c ange 
a short time. Also, a given tract is far from homogeneous. For examp 
viduals with a very high or a very low incomo may be clustered within c 

‘These data are adapted ficwn summaty laWes, based on the 1960 Census 
lion, compiled by ihe Research Division of the United Community Services 
poUlan Boslon. 



SeUcled References for Further Study 
if we know how to put them tofether, we will be able to assemble the puzzle 
and see the picture it forms. 

One approach which may be adopted by an investigator in the analysis 
of available data is to work with the original data, perhaps testing new rela* 
tionships on the basis of different hypotheses and theories. Such data need 
not be limited to those emanating from interview or questionnaire surveys. 
The historian and political scientist, for example, usually work with many 
other types of documents. One codihcalion of documents on a wide range of 
cultures is The Human Relations Area Files, which is of special interest to 
researchers interested in cross-cultural comparisons. 

The social scientist may not wbh to deal with, or have ready access to. 
original source data, and he may base his investigation on published materials. 
One well-known example is a symposium* on the scope and method of The 
American Soldier (two volumes which reported on researches conducted by 
the War Department during World War 11). The result led to what many re- 
gard as important contributions to reference group theory. 

The recently published inventory of social science research. Human Be- 
havior,* provides a genuine challenge for the social scientist. He may use it in 
a relatively traditional manner — i.e., as an aid to locating published materials 
that bear on his problem. He may also use it to provide an initial test for 
hypotheses or theories, providing that such formulations are tested by the 
studies included. Where he futds contradictory results, he may seek to explain 
them, perhaps through a modification of his own formulation. This may then 
be followed by a direct test of such a formulation through the collection of 
additional data. 

SELECTED REFERENCES FOR FURTHER STUDY 

Louis Gottschalk. Clyde Kluckhohn, and Robert Ancell, The Use of Per- 
sona} Documents in History, Anihropology and Sociology (New York: Social 
Science Research Council. 1947). Methods of utilizing persona] documents are 
treated separately for each of the fields, and a good deal of the flavor of these 
disciplines is conveyed. Numerous illustrations of research utilizing this ap- 
proach are presented. 

Robert K. Mestov and Paul F. Lazarsfeld (eds.), Continuities in Social Re- 
search: Studies in the Scope and Method of "The American Soldier” (New 
York: The Free Press of Glencoe, 1950). This volume illustrates some of the 
things that can be accomplished through the re-analysis and interpretation of 
prior studies. Of special interest to many students of methodology will be the 

• Robert K. Merton and Paul F. Lazaisfeld (edv), Continutlies in Social Research 
(New York: The Free Press of Glencoe. 1950). 

’ Bernard Berelson and Gary A. Steiner. Human Behaiior; An Inieniory of Scientific 
Findings (New York: Karcourt. Brace 4 World, 1964). 


133 



[Ch.1 

USE OF DOCUMENTS 

to 3450— a loss ol 13.4 “ “"rf migration, rtmning, 

change, (or example, ouMaid j should be supple- 

utban renewal, and the analysts of census tract 
mented by other studies for best results, 

7.5 PRIOR SOCIAL SCIENCE RESEARCH 

Social scientists have been doing o^a wide range of 

years. For example, the amount of survey dau collected 

topics ts vast. technlnucs is the variety of 

One of the important advantages of s y S dilferent research 

data that can be collected and '>'“"Eht to ted diHerent 

questions. Because the anal>sis of g hiu ^xnlorinc any one of these, 

dtreetions and much bme is involved .n “P' d.Ltions. Thus 

any given mvest, gator eVtaited extent, and they 

vast amounts of data are analyzed only to a very t™ 
constitute an important set of documents “ institutions as well 

These data ate located at universiues and other ".t'?'' decade. The 

as]n a number of survey archives developed «»'» So, BetWey. 

major arehtves within the c ^ 

New Haven, and Williamsiown. The Roper " oUectcd by over 

ut WilUams College, for example, h bntcr 

forty research organizaiiom. mcluding '[** Research (Columbia 

(Universiiy of Chicago); the Bureau of Applied 

University); the OtTice of Public Opinion ^ , Crossley, Inc.; the 

the American Institute of Public Opinion (Ga up • - Psycho- 

Opimon Research Corporation; Alfred P^rtz ’ = jes- Market Re- 

logical Corporation of New York; Elmo Roper and Prancais d'Opi- 

search Africa, Ltd.; Australian Public Opinion rtaliano dell’ 

nion Publique; Nederlandse Sticbling Voor Slatistiek, Institute of 

Opinione; Societi Ualiana per le Riccrchc ^M«cato; Swedish nsut^_^^ 
Public Opinion Research, Ltd.; and the Canadian Institu e o j the 

In seeking to analyze such data, the investigator is, , nuestlon- 

ilems that were incorporated in the ori^nal interview sc e u es . 

naires. If, however, he is willing to settle for items that were 
other purposes but that arc nevertheless adequate for his own, . . 
it possible to test and further develop his hypotheses and theory ^ 

context of an enormous ran^ of data. Such data record responses 
human beings to questions pul to them by other human beings. ^ . 

truly general in scope, it should be able to explain a good dem o su » 

provided that adequate measures for the crucial variables can be a ap 
the on^nal items. Each survey might be considered to be a puzzle, e p 
of which are items used in the survey. If the pieces ate, in fact, aval a e, 



Observation 


8.1 INTRODUCTION 

The Nature of Observation 

Observational methods of data collection are techniques for gathering in- 
formaltoQ without direct questioning oo the part of the investigator. They may 
Intolve one or more of his senses, and methods developed thus far center on 
his ability to see and hear. 

Although it is possible to consider these methods separately, they are very 
often used in combination with others. The interviewer, for example, may be 
required to make obsen-ational ratings on the characteristics of a respondent 
or his environment. If the interview is not a standardized one, his observations 
of the respondent's behavior aid him in formulating questions and in interpret* 
ing the significance of the respondent’s answers. Communication seems to 
proceed at a non-verbal as well as a verbal level, and the interviewer who is 
alert to non-verbal cues and adept ai mlerpreting their significance may emerge 
with a far greater knowledge of the respondent than one who pays attention to 
verbal responses alone. Many psychoanalysts stress non-verbal communica- 
tion as a means of detectiog phenomena that the respondent might either be 
unable or unwilling to import. Observation of such behav-ior constitutes an 
indirect dala-coUection method, similar to projective tests and other indirect 
methods. 


Relationship to Other Methods 

Each data-collection method is admittedly limited and imperfect, and it is 
often a combination of several methods that gives rise to a more complete 
Understanding of the phenomena under investigation. Such a combination of 
approaches is illustrated in anthropology. Under ideal conditions the anthro- 
polo^st learns to include an analysis of av^able data about the society be 
plans to study before collecting new data. The period during which he lives 

135 



lCh.7 

USE OF DOCUMENTS analvsis with its 

by K=„b... ..d ba.-Wd o„ bt su^ 

usdul emnipk of ,httc-»arijbl. analyso- 

W Moore (cd.), R<«d.»£> ^ ° ph ,1. compr«teoii.= 

HRAF PEES!. 1961). ThE Itoma» „,lorES, pro.iJE. tb« «- 

collEClion ol wnll.n malemls on “ ” |l„s volomc inEOtporMEs paptn 

EEREEbE, E.E.h R E.Eh EOUEEE ''“^„,.nEal EEEE.EEh, 

on dilEEtnl UPEEI. ol IhE EEEElhodology ol crosKunbEa. 


134 


Observation 


8.1 INTRODUCTION 
The Nature of Observation 

Obscnaiional methods of data collection ate techniques for gathering in* 
formation without direct questioning on the part of the investigator. They may 
involve one or more of hi$ senses, and methods developed thus far center on 
his ability to see and hear. 

Although it is possible to consider these methods separately, they are very 
often used in combination with others. The interviewer, for example, may be 
reqidred to make observational ratings on the characteristics of a respondent 
or his environment, if the interview is not a standardized one, his obsen'ations 
of the respondent's behav ior aid him in formulating questions and in interpret- 
ing the signihcance of the respondent’s answers. Communication seems to 
proceed at a non-verbal as well as a verbal level, and the interviewer who is 
alert to non-v crbal cues and adept at interpreting their significance may emerge 
with a far greater knowledge of the respondent than one who pays attention to 
verbal responses alone. Many psychoanalysts stress non-verbal communica- 
tion as a means of detecting phenomena that the respondent might either be 
unable or unwiiling to import. Observauon of such behavior constitutes an 
indirect data-collection method, similar to projective tests and other indirect 
methods. 


Relationship to Other Methods 

Each data-collection method is admittedly limited and imperfect, and it is 
often a combination of several methods that gives rise to a more complete 
understanding of the phenomena under investigation. Such a combination of 
approaches is illustrated in anthropology. Under ideal conditions the anthro- 
pologist learns to include an analysis of available data about the society be 
plans to study before collecting new data. The period during which he lives 

135 



USE OF DOCUMENTS i„ 

.,.c.,on b, Ecndall .nJ 
„,riul e»mpl= oI thr...vsrubk 

Funk W Moo«e (td.). RMd-'W » p,,',' ils comprehensive 

HRAF Press. 1961) The ™ ciunres. provides .he re^ 

colleci.on ol written muter, tils on > w“^ ™ incorporeles pnpem 

searcher with a nch source o avai „assreu!tiital research, 

on dillereni aspects ol the meihodolosy ol ctossreuimta. 


134 



Observation 


8.1 INTRODUCTION 
The Nature of Observation 

Observational methods of data collection arc techniques for gathering in- 
formation without direct questioning on the part of the investigator. They may 
Involve one or more of his senses, and methods developed thus far center on 
his ability to see and bear. 

Although it is possible to consider these methods separately, they are very 
often used in combination Mith others. The interviewer, for example, may be 
required to make observational ratings on the characteristics of a respondent 
or his environment. If the interview is not a standardized one, his observations 
of the respondent's behavior aid him in formulating questions and in interpret- 
ing the significance of the respondent's answers. Communication seems to 
proceed at a non-verbal as well as a verbal level, and the interviewer who is 
alert to non-verbal cues and adept at interpreting their significance may emerge 
with a far greater knowledge of the respondent than one who pays attention to 
verbal responses alone. Many ps>choaDaI)sts stress non-verbal communica- 
tion as a means of detecting phenomena that the respondent might either be 
unable or unwilling to import. Observation of such behavior constitutes an 
indirect data-collcction method, similar to projective tests and other indirect 
methods. 


Relationship to Other Methods 

Each data-collcction method is admittedly limited and imperfect, and it is 
often a combination of several methods that gives rise to a more complete 
understanding of the phenomena under investigation. Such a combination of 
approaches is illustrated in anthropology. Uruler ideal conditions the anthro- 
pologist learns to include an analysis of available data about the society be 
plans to study before collecting new d:Ua. The period during which he lives 

I3S 



USE OF DOCUMENTS 

selection by Kendall and LazarsIeUI on pro 
nsefol erantple ol Ihrce-yaitaWe anal)Oi 
FnaMt W Moore (ed 1. Rend.nyt " p'ta, »ith ns comprehensive 

“sr.;tpe:rrro::b-::^^"^^ 


134 



Observation 


8.1 INTRODUCTION 
The Nature of Observation 

Observational methods of data collection are techniques for gathering ia> 
fonnaiion without direct qucsiioniog on the pan of the investigator. They may 
involve one or more of his senses, and methods developed thus far center on 
his ability to see and hear. 

Although it is possible to consider these methods separately, they are very 
often used in combination v>ith others. The interviewer, for example, may be 
required to make observational ratings on the characterisUcs of a respondent 
or bis environment. If the interview is not a standardized one, his observations 
of the respondent's behavior aid him in formulating questions and in interpret* 
ing the significance of the respondent’s answers. Communication seems to 
proceed at a non-verbal as well as a verbal level, and the interviewer who is 
alert to non-vcibai cues and adept at interpreting iheir significance may emerge 
with a far greater knowledge of the respondent than one who pays attention to 
verbal responses alone. Many psychoanalysts stress non-verbal communica- 
tion as a means of detecting phenomena that the respondent might either be 
unable or unwilling to import. Observation of such behavior constitutes an 
indirect data-coUcciion method, similar to projective tests and other indirect 
methods. 


Relationship to Other Methods 

Each data-collcction method is admittedly limited and imperfect, and it is 
often a combination of several methods that gives rise to a more complete 
understanding of the phenomena under investigation. Such a combination of 
approaches is illustrated in anthropolc^'. Under ideal conditions the anlhro- 
polo^t learns to include on analysis of available data about the society be 
plans to study before collecting new data. The period during which he lives 

133 



USE OF DOCUMENTS lCh.7 

seleciion by Kendall and LazanfeUl on problems of survey analysis with its 
useful example of ihrce-vaiiable analysis. 

Frank W. Moore (cd.), Readingt in Cross-Cultural Methodology (New Haven; 
HRAF Press, 1961). The Human Relations Area Files, with its comprehensive 
coUeciion of wriUcn maleiiaU on a wide range of cultures, provides the re- 
searcher with a nch source of avmlable data. This volume incorporates papers 
on di^ercnt aspects of the inctbod<4ogy of cross-cultural research. 



ObservaHon 


8.1 INTRODUCTION 
The Nature of Obserxatioii 

Observational methods of data collection are techniques for gathering in* 
formauon without direct questioning on the part of the investigator. They may 
involve one or more of his senses, and methods developed thus far center on 
his ability to see and bear. 

Although it is possible to consider these methods separately, they are very 
often used in combination with others. The interviewer, for example, may be 
required to make observational ratings on the characteristics of a respondent 
or bis enviwamenl. It the interview is not a standardised one, his observations 
of the respondent's behavior aid him in formulating questions and in interpret* 
iog the significance of the respondent’s answers. Communication seems to 
proceed at a non-verbal as well as a verbal level, and the interviewer who is 
alert to non-verbal cues and adept at interpreting their signihcance may emerge 
with a far greater knowledge of the respondent than one who pa>'S attention to 
verbal responses alone. Many psychoanalysts stress non-verbal communica- 
tion as a means of detecting phenomena that the respondent might either be 
unable or unw illin g to import. Observation of such behavior constitutes an 
indirect data-collection method, similar to projective tests and other indirect 
methods. 


Relationship to Other Methods 

Each data-coUection method is admittedly limited and imperfect, and it is 
often a combination of several methods that gives rise to a more complete 
understanding of the phenomena under investigation. Such a combination of 
approaches is illustrated in anthropolo^. Under ideal conditions the anthro- 
pologist learns to include an analy^ of available data about the society be 
plans to study before collecting new data. The period during which be lives 

135 



lCh.8 

OBSERVATION ^ 

with the people he studies aflords ^’PP^^hl^^^tiTecelil years some anthro- 
eaage of observational and tHe 

polOEists have become interested in adding to melt rep 

niques popuiar among sociologists. considet- 

Such a eombmation of research ^ ,he sociologist who 

able value in other social sciences as wcU. F ■ ^ „ally well advised to 
intends to conduct a survey of a seseareh setting. In addi- 

begin by thoroughly familiarizing himwU materials, much can 

tion to a search of available pobhshed md u P hi ^ ^ 

be learned by observational t'aP"”I"“- , “,"„d. What he learns 
well as procedures that ate carefully ^ „,balize, Neverthe- 

as a result of them may frequently be p„doclive both m 

less, such observation has »>“ me setting md i^ the constniction 

the development of theory whmh “PP''“ which prove to be efleetive. 

of interview schedules and other lescatch twls P j ^ techniques also 

Useful within the contest of ^ “‘’“X, For esaiple. the 

constitute an important aspect of “P"™' “lA:™,., may be based on the 
pretest and posttest of the belore-and-alter interviews, 

observation of behavior rather than on P»P''-“"^ .P Omental situadoos 
As in the survey, careful observation procedure as well 

may provide insight into the hmitntions of the cape P me 

as ideas for tulure study, Guthtie, discusang ^ , situations: 

impoiunee of obsciving very fine detaiis in eaperimemai and oinet 

ta h&F he 1^ 

When Skinner recorJj how many lime* a rai ’ ,wg gnviton* 

••environmenial events,” or ihc effect of Ibe rat* mov change >u 

ment. But Gulhfie argues that the rat could have . ny other 

enrivonment by use of his paw. his snout, his ear. or 
■•movements That is. calling the response a bar press was 

getting” covers up the imporUnt event, that is, whe ®‘' .. [ce- 

"pawmg at the bar." “a snout-picss.-* “an Kc-cream snatch, or 


8.2 OBSERVATION AND VALUES 


Ability and Willingness Reconsidered . 

In discussions of the relationship between the interviewer jhe 

ent in Chapters 6 and 7, it was pointed out that two factors t a 
quality of data secured are the ability and the willingness of the resp 


• Wtlltam W. Lambert. “Sl.rouliis-Response Contiguity and Jv^rCam- 

Social Psychology." in Gardner Lmdzey (ed.). Handbook of Social rsi 
bridge. Mass : Addison-Wesley Publishing Co. 1934. p. 74). 


136 



Stc.SJl Obsenation and Values 

give truthful answers to questions asked. For example, he may not possess an 
awareness of his own values or goals, or he may deliberately seek to avoid 
placing himself in an unfavorable light. It will now be useful to re-examine 
this distinction with the object of providing a more thorough treatment of 
some of the subtleties involved in order to better depict the place of observa- 
tional procedures within the research enterprise. We shall limit this analysis 
to the problem of obtaining data on the values or goals of the individual, 
although the discussion should have broader relevance. 

What is central here is the de&nilion of goal. By conceiving of goals as ends 
of which the individual is not necessarily aware, we imply that there may be 
a wide divergence between those goals he thinks he has and those he actually 
does. In other words, his self-image with respect to goals may be far from 
congruent with reality. 

This distinction between the statements of the individual and the reality 
may be seen within the context of act meaning and action meaning. With 
respect to an individual's goals, act meaning refers to the individual’s own 
conception of them, whereas action meaning refers to those goals for the 
existence of which there is scientific evidence. The collection of such evidence 

is, of course, a very difficult problem. 

In this light, we may distinguish between the goals that actually sen'e to 
direct the individual’s behavior and the individual's self-image or expectations 
as to the nature of his goals. The ability of the individual to respond truthfully 
about his goals is manifested by the congruence between these two factors. 
This is not necessanly identical with the ability to see onesell as others see 
one, since others might be wrong according to the standards of scientific 
evidence. 

The willingness of the individual lo respond “Iralhfuljy’’ may refer either 
to the “truth” as the individual believes it to be or the truth as the scientist 
believes it to be, and thus there is definite ambiguity in the phrase. When the 
individual is unwilling to tell the truth, his reference is to the truth as he be- 
lieves it to be, since he is generally unaware of the other truth. It may even 
occur that the individual, in his efforts to hide or distort the truth as he sees 

it, ends up by revealing the truth as seen by the scientist. We may wonder here 
whether to say that the individual is lying or telling the truth, but the appear- 
ance of such ambiguities in ordinary parlance is not unusual. 

Relevance for Observational Procedures 

The role of the researcher would indeed be a simple one if observational 
techniques automatically yielded the truth about the nature of the individual’s 
values (as distinct from the act meanings of the individual as yielded by inter- 
view procedure). Some critics of the interview technique naively assume that 
reality is far more easily discovered when the investigator turns his attention 
from words to deeds. If this were indeed the case, then we should presently 

137 



lCh.8 


... ^ 

.o„ .0., 0. — t 

0 bse.vM 0 »al procedures, smee bMh c» y complcmcnury 

above discussion seems to imply J,b„s= 60»ls ‘>>= 

data. Interview data yield information ””^„j-,v,d„al will faithfully coo- 
ls aware of The situation might be such that he O, ,be bdl- 

vey to the interviewer a detailerl picture of such 5 

vidual might to a greater or lesser degree distort this piclu 
of an image more socially acceptable. conscious desire of the 

Observational data are generally 'ess sabicm ^ j,p„ds on die 

individual to present a false image of himsel . 8^ ^ p^j^ad of 

period and type of observation. “^“"“‘“ a„ that he is being ob- 
rime, and in instances where the lud.v.dud is unaware 
.ervid for scientific purposes, the observed individual is less y 

to successfully maintain such a pose. -fleet accurately ihe indi- 

Bat i, it true, therefore, that obscrvatioual “ tas goals may 

vidaal's ceascioas goals’ Not necessarily siace ,tc i'dividual's 

markedly diverge from the goals that actually investigator 

behavior. Observalional procedures are *1“" ..J IB „ot aware but 

locate and obtam evidence fo, goals of v;*ieh the md v.dual » » 
which nevertheless guide his behavior. The label as 

hiding torn his own awareness those goals ho woul y 

lepuffiant, but the carelul observer will hopefully be able lo laen 

'°Does this mean, then, that obseivalional data ate ''p'jdous'goals? 

measurement of covert goals, whereas interview data yi 

Not at all. Interviewing is generally superior for j^gge he 

in a situation where the respondent has no reason to is 

projects In cases where he attempts distortion it is quite use j.o\iected 

searcher to be able to fall back on observational data. prcteraDiy 

over a period of time, since it is difficult for the individual o 

conscious distortions consistently in a scries of situations. 

As for the measurement of covert goads, information about ov 
derived from interview data may provide a valuable supplement o ^ 
tional data. The individual may convey in the interview situation “ P* 
himself as perfectly honorable, good, self-sacrificing, etc., and he may 
ally believe this to be the case. Knowledge of this self-image, fa 
be, can be a valuable clue to uncovering covert goals. The maiU' 

then search for the covert reasons or goals dial impel the individua o 
tain such a picture of himself. He may find that the individuals “ . 

maintain such a picture of himself in fact covertly alTects much of his e , j 
and that he will go to almost any lengths (unrealized) of perfidy lo 
facing himself. 



Sec. 8.4\ Less Structured and More Structured Techniques 

8.3 SELECTION OF PHENOMENA FOR OBSERVATION 

If the detailed movements of the organism arc important to an understand- 
ing of its behavior, the problem of selecting phenomena for observation be- 
comes very real. If we are investigating interaction between two individuals, 
do we obser\e the exact sequence of physical movements made by each, the 
physiological changes each undergoes during the process, the words spoken 
and other sounds made by each (taking note of emphasis, intonation, pitch, 
and pauses), the physical environment and how it relates to the interaction. 
Or a combination of all these? If an investigator wanted to observe any of these 
factors, he would have to define carefully what it is that he is observing as 
well as develop specific indices or measures. If he is interested in “physical 
movements,” he must specify what constitutes a separate and distinct move- 
ment, for a good deal of behavior appears to be relatively continuous. He 
might decide to take sound movies of the interaction, but although this would 
enable him to collect much of the data he wants, the problem of processing 
it would have to be faced eventually. One of the crucial problems confronting 
anthropologists has been that of effectively coping with the mountains of data 
Collected during field work. TTiis problem is certainly not unique to observa- 
tional data<ollection methods, for surveys and other types of data collection 
can be equally prolific. 

It is important to understand that the problem here is not merely one of 
selecting from masses of data or quantities of discrete phenomena the items 
of neatest import. Phenomena arc not discrete entities; rather, it is the ob- 
server who cafegori2e5 them on the basis of a variety of concepts and frames 
of reference which enable him to organize the incoherent jumble of the real 
World. Once phenomena are so categorized according to a particular set of 
Orientations, the result may be perfectly useless in achieving an understanding 
of what has occurred. It will be recalled that a given concept may or may not 
have clarity, wide scope, and systematic import; without these characteristics 
it is difficult indeed to dev clop a theory that can ciTcctiv ely explain and predict. 
The problem of selecting phenomena for observation seems, then, to be intri- 
cately linked with the problem of conceptualization. If we arc conducting 
observations on choice of spcdalily in a medical school, for example, our 
ideas about which factors a^eci those in which we arc interested, and how 
they all operate together, will serve to focus our attention on certain phe- 
nomena. Because the observer is bombarded by a vast amount and variety of 
stimuli, he must choose among them. 

8.4 LESS STRUCTURED AND MORE STRUCTURED 
TECHNIQUES 

It may be argued that it 5$ poor suaiegy to attempt to collect data by com- 
miiiing oneself to a particular set of concepts and a theory and thus neglecting 



OBSERVATION ^ 

to respond). Although Chappie's own procedures have not taken into account 
the content of what is said or desaip^its of what is done during the inter- 
action, there is no reason why the two cannot be combined. One of the impli- 
cations of Chappie's work is that observation which is highly detailed and 
accurate may be able to reveal important factors missed by less detailed and 
precise procedures. 


Less Structured Observational Techniques 

Less structured procedures fall into two broad types: participant and non- 
participant observation The distinction has to do with the role of the observer 
within the obsenational situation and indicates a diHerence in the degree to 
which the observer adopts the roh of member of the group. 

Many of the problems and procedures of less structured observation have 
been developed within the context of anthropological field work and sociologi- 
cal participant observation research.* Becker and Geer have analyzed the 
method of participant observation on the basis of their experiences wiUi a study 
of medical students.* They distinguish among three types of participant 
observer roles: (1) the investigator may in fact be a mem^r of the group he 
studies; (2) he may pose as a member although he is not; and (3) he may 
join the group in the role of one who is there to observe. They specify three 
stages ibrough which their field work progressed. ( I ) selection and definition 
ol problems, concepts and indices; (2) measurement of the frequency and 
distribution of phenomena, and (3) incorporation of the findings into a model 
of the organization under study This represents within a single study an 
attempt both to develop important concepts and measures within the context 
of discovery and to construct and lest propositions within the context of 
justification Although theit research method is initially relatively unstruc- 
tured, more structured observational leebniques are subsequenlly developed 
in order to provide evidence within the context of justification. With this 
procedure, however, the investigator must be wary of several pitfalls, which 
have to do with claims made within the context of justification. One of these 
is the possibility of circularity when hypodieses arc both developed and tested 
on the same data. Another has to do with whether the unit of investigation 
IS the individual or the social system as a whole. Both of these difficulties can 
be remedied to the extent that additional data are collected. 

Although the Becker and Geer study illustrates a situation in which the 
researchers join the group in the role of obsecirers. a participant observation 
study by Roth ^ provides an example of a study in which the investigator is 
in fact a member of the group Shortly after receiving his doctorate in sociol- 
ogy, Roth had to be hospitalized for a tuberculosis condition. Unknown to 


* Richard N. Adams and Jack } Pixus (ed, ). Orcamzalion Rcteenh FitU 

ReU>ii<}ns,mJ Teif, mines { Homtwood. Ht the Dorsey Pms, 1960), 

Howard S. Becker and Blanche Geer. "PaiUopam Obsetvaiion; The Analysis of 
QuatiUUVft ^lelJ Data, lo Adams ami Preiss, op rit., pp 267-89. 

’Julius A. Roih, TinietaHes (Indianapoiis. The Bobbs-Mernll Company, 196J). 



Selected Referemes for Further Study 
the staff and other patients, he utilized his stay to carry on relatively unstruc- 
tured observation with respect to the social structure of the hospital. Roth’s 
observations covered a wide variety of aspects of hospital life, but the analysis 
and interpretation of these data (as revealed in timetables) is focused on the 
structuring of the passage of lime within the hospital setting. “The first im- 
pression one gets of the TB patients’ coacern with time,” Roth observes, “is 
that everyone is frantically trying to find out how long he is in for.” • Further- 
more, patients are constantly demanding to know when they will be permitted 
more activities, when they will get a pass, when they will get surgery, when 
they will be given another conference, and so on. After his analysis of the 
timetables or norms that structure the passage of time in the tuberculosis 
hospital, Roih proceeds to compare them with timetables in other settings, 
such as the mental hospital, the penal institution, the educational system, and 
the occupational career. Roth’s study provides an example of research that 
begins at a relatively low level of abstraction and ends at a relatively high 
level. Although he seems to have had very little in the way of specific hypoth- 
eses to test at the initiation of the study, his conclusions with respect to (he 
mechanism for the development of norms that structure time are sufficiently 
abstract to have very wide scope. They help to point the researcher in the direc- 
tion of analyzing the dynamics of situations in general, and they provide clues 
as to what to look for when analyzing changes. 

SELECTED REFERENCES FOR FURTHER STUDY 

Oscar Lewis, The dnldren of Sanchez (New York. Random House, 1961) The 
author, an anthropologist, makes use of the results of his years of study and 
observation in Mexico to focus attention on the life stories of four individuals. 
This illustrates the effectiveness of the interview procedure when it is directed 
by thorough knowledge based on a variety of data-collection techniques. 

Julius A. Roth, Timetables. Structuring the Passage of Time in Hospital Treat- 
ment and Other Careers (Indianapolis: The Bobbs-Memll Company, 1963). 
Most of this volume is taken up with a participant-observation study of several 
tuberculosis hospitals, with the remainder given over to an analysis of other 
career timetables — e.g., the atmy, the educational system, the business executive. 
In all these institutions the passage of time U structured, and this structuring has 
significance for social processes in general. 

Rickard N. Adams and Jack J. Preiss (eds.). Human Organization Research: 
Field Relations and Techniques (Homewood, III.; The Dorsey Press, 1960). 
This coHeclion of readings emphasizes research in non-Western societies, with 
most of them having to do with interviewing and observation in a field, as 
distinct from a laboratory, setting. The authors of the readings are anthropolo- 
gists and sociologists primariiy. and their coverage often encompasses the analy- 
sis as well as the collection of data. 

• Ibid.. XVI. 

143 



SimulaHon 


9.1 THE NATURE OF SIMULATION 

Simulations are models or analogues of some aspects of phenomena. They 
may be physical or symbolic models, or some combination of the two, but 
they are also operating models and, as such, they generate data. In Section 

3.2 an example was presented of a simulation of the choice process in general 
and of occupational choice in particular, and the suggestiveness of such simu- 
lations within the context of discovery was iliustrated. The method has re- 
cently come to be one of major importance within the social sciences. 

The fact that a model is an operating model and thus produces data of its 
own has broad implications for the undentasdisg of processes. This may be 
best understood if simulation techniques are compared with the daia-coUectioQ 
methods already discussed. We mi^l also examine their contribution in the 
context of justihcation as wel) as in the context of discovery. First, however, 
it is important to understand (hat simulation — whatever else it may be — is a 
method for generating different ^pes of behavior and that it is possible to 
utilize any or all of the previously discussed techniques in the analysis of such 
behavior. Thus, within the context of a simulation, one might proceed to 
experiment or interview or observe or do secondary analyses of data originally 
collected for other purposes. Thus, simulation techniques generally do not 
compete with the more traditional methods, which are utilized to analyze the 
data produced by the simulation. 

9.2 SIMULATION AND THE EXPERIMENT 
Some Differences 

Experiments usually are undertaken when the investigator has formed very 
clear-cut hypotheses and when it is feasible for him to manipulate one or more 
independent variables in order to mialyze their effects on one or more depend- 

U5 



ICh.9 


e„t vanabks. A high hcgtea ot control nttainabln ,n the “P'™™ 
male the inferences of the cxpenmcnter mote dermitite than those usua y 
made m other methods of mrestigatton. The eh.et advantage of 
seems to he m the contest ot jnstiheation. Bnt senous questions may be m sed 
as to just how much we do understand of what is going on even m highly 
controlled experiments. Although we can control for the eHects of the pretest 
on the subjects, how much do we thereby learn about the dynamics of such 
effects'* The same can be said with respect to the interaction effects of the 
pretest on the test variable or ireaimcnl. How much do wc learn about human 
behavior when we find, for example, that a given treatment leads to a certain 
degree of change in the average performance of a number of individu^s. 
Of course, it all depends on the theoretical importance or fruitfulness ol tne 
proposition under investigation But let us assume that the proposition is 
potentially of great importance- How much is our understanding increased y 
the experimental data m not»-simulatcd situations'’ 

In order to understand the basis for these questions, it is necessary to 
reconsider the role of scientific concepts and theory in understanding every- 
day phenomena The layman often views observations as direct indicators 
of reality The behavior^ scientist, however, is well aware that act meaning 
IS not action meaning, and that it is ui the concepts, propositions, and theories 
developed by the scientist that greatest hope for understanding and predicting 


phenomena lies 

Let us suppose now that a scientific theory has been developed which seems 
to offer great promise for understanding various asficcls of behavior. Even it 
wc perform highly controlled expenmems relating to any given aspect of ths 
theory, we are neither testing the theory as a whole nor maximally using the 
theory itself to understand and control what is going on in the experimenta 
situation. If the scientific theory docs indeed constitute our best mechanism 
for understanding the reality, then it may be useful to attempt to apply it 
completely as possible, Even hi^ly controlled experiments can be further 
controlled by the application of theoretical knowledge within the 
mental situation itself. This may lead both to improved ways of testing the 
theory as a whole (context of justification) and additional ideas as to what 
may be actually going on during the expccinient. 


Experimenting Within the Context of Simufario/i 

Simulation implies the introduction of methods that make use of available 
theoretical knowledge to control the data-collection situation. The game 
“Careers” centers on three factors: the values or goals each individual adopts 
within the game; the expectations for achieving the various goals each player 
forms as a result of reading the descriptions of the squares on the board; an 
the choices that have to be made by each player throughout the game. If 
is indeed true that choice is primarily a function of values and expectations. 



Simufaiicn and Closure 


Sec. 9J] 

then the game situation senes to sweep away the thousand-and-one extrane- 
ous factors that get in the way of the researcher v^hen he tries to determine 
fthai actually occured in a gi\cn experiment. The simulation makes use of 
avsulable theory to structure the experimental situation and produce the very 
detailed data on just those relationships the investigator bcUeves are most 
important for exploration and testing. If, for example, he is interested in the 
ways in which different kinds of expectations affect choices, then he might 
proceed in the following manner within the framework of a one-conlrol-group 
before-and-aftcr design: 

1. Using randomization procedures, divide subjects into an experimental 
group and a control group. 

2. Match or randomize all other conditions for these two groups. 

3. Give each group the same pretest by allowing them to play with one 
version of ‘'Careers” and record the choices made. 

4. Introduce a different version of the game for the experimental group — 
one w hich incorporates certain changes in the descriptions of the squares 
on the board and which consequently will result in differences in expec- 
tations for achieving the various goals. 

5. Record the choices made by the experimental group (posttest) and also 
allow the control group to play with the original version of the game 
once more and record their choices. 

One may consequently draw inferences about the relationship between the 
changes in the descriptions of the squares (expectations for achieving goals) 
and the choices made by subuacting changes in the control group from those 
in the experimental group. 

9.3 SIMULATION AND CLOSURE 

Simulation procedures, as they have been utilized thus far,* generally focus 
on the achievement of closure by t^ng into account those phenomena that 
seem to be most important for understanding the process under investigation. 
This differs from most kinds of laboialoiy experimenls, which focus on only 
one or a few aspects of a given process. The “Carters” simulation, for exam- 
ple, includes within its purview choices, values, and expectations for fulffUing 
them. What is needed is not geographical isolation but, rather, the inclusion 
of whatever factors affect the phenomena under investigation. Once these 
factors have been included and data on them emerge from the simulation, the 
way is open to discover and test the mechaDism whereby these factors interact 
with one another. 

* For a review of siiniJlation procedures, sec Harold OuetzLow (ed.). Simulation in 
Social Science: Readings (En£tewood Cfiffs, NJ.: Preutice-Hal), 1962). 


?47 



SIMULATION 


lCh.9 


9.4 SIMULATION AND LEVEL OF ABSTRACTION 

One of the important features of sunulation is that it usually entails a large 
gap between the surface characicristtcs of the siniuiation situation and the 
characteristics of the process simulated. There is, thus, a high degree of abstrac- 
tion involved — le., a selecting out from the numerous characteristics of the 
actual process those which seem to be most crucial. The “Careers" game 
provided an example of this abstracting process. 

The simulauon of international relations furnishes an illustration in which 
this process is earned forward much further. For example, nations must be 
“constructed,*’ uith the complexities of ibeir internal and external affairs 
somehow taken into account. The student of social research may well question 
the utility of such “unrealistic" simulations. Perhaps answers will emerge as 
a result of examining one of (hem. 

Guetzkow has developed a simulation in which each nation is represented 
by one or more decision makers.* In one situation there arc two decision 
makers, the internal decision maker (IDM>. with final rcsponsibilily for all 
of the nation’s policies, and the exMinal decision maker (EDM), who con- 
ducts foreign policy and reports back to the IDM. In one series of runs of 
the simulation, senior and graduate students concentrating in political scicnM 
were used, and it was found that some of the IDMs eollaboralcd tvUh their 
EDMs in making decisions whereas others did not. Another scries showed 
that nations with a single decision maker tended to pay little attention to 
internal affairs while those with three frequently became bogged down in 
internal processes, with intle time temaining for external affairs. 

Goals for each nation were spcciRcd, sometimes by the researchers {e.g » 
security, domination, cooperation, internal growth) and sometimes by the 
students, and each nation attempted to achieve its goals. Decision makers 
were also concerned with the goal of being maintained in oRicc, with the 
probability of this being dependent on the de^cc to which the nation's goals 
were being fulfilled. Such calculations were made by researchers only af 
certain intervals, giving decision makers the opportunity to lake long-range 
considerations into account in their clToits to fulfill the nation’s goals. Nations 
were subsequenefy allowed to reformulate their goals. 

At intervals, each nation received basic resources which could be used fur 
internal or external purposes. These resources could aid in internal growth, 
for example, or they could be used for purposes of dominating other nations. 

Nations could interact in many different ways. They could build permanent 
forms of international cooperation or they could declare war. They could 
seek to establish bilateral or multilateraZ alliMiccs or refrain from such efforts. 

* Harold GuetzWow. “A Use of SimulaUon m ihe Study of Inter-Nsiion Relations.” 
BehaMoral Science, 4. 183-19! (1959). 



Sec. 9.4\ Stmuialion and Level of Abstraction 

Treaties could be made specifyiog an exchange of resources, or resources 
could be given by one nation to others. 

As yet so little simulation in this area has been done that it is difficult to 
assess the utility of these efforts. However, we arc able to specify some of the 
ways in which they might be useful (other than as training devices) : 

1. They may provide descriptions of international situations that have 
never existed but may occur in the future. For example, Kaplan dis- 
cusses six distinct international systems, with only the first two having 
actually had historical counterparts: “( I ) the ‘balance of power’ system, 
(2) the loose bipolar system, (3) the tight bipolar system, (4) the uni- 
versal system, (5) the hierarchical system in its directive and nondirective 
forms, and (6) the unit veto system.” * Data from simulations may aid 
the investigator In speculating about the range of possible systems that 
might occur and about the possible characteristics such systems might 
have. Here, simulation would offer the possibility of aid within the con- 
text of discovery. 

2. They might lead to the development of specific hypotheses about the 
course of international events, and these may subsequently be tested by 
actual occurrences. Nfost social scientists would agree that we are still 
a long way from being able to make highly specific and accurate predic- 
tions due to the complexity of international relations relative to the 
effectiveness of available theories. One of the major difficulties has to 
do with our ability to puli together theories about the behavior of indi- 
viduals and theories of group processes. 

Yet, from the Guetzkow simulation we can see that it is possible to 
specify the relationships between individual and group processes in 
considerable detail. A given society makes its influence on the decision 
makers felt by means of the relationship between the degree to which 
national goals have been achieved and the probability that these decision 
makers will be maintained in office. The decision makers, in turn, can 
modify these goals to an extent in accordance with their own goals. They 
can also fail to achieve national goals in the short run and still remain in 
office due to the infrequency of ‘‘elections.” In making predictions about 
specific nations, it would be necessary to have accurate information about 
the goals of the key decision makers as well as of the nation as a whole. 

It may easily be maintained that those simulations thus far investigated 
neglect to include any of the important complexities of international 
process. Nevertheless, it is possible to increase the complexity of these 
simulations. If predictions based on a given simulation prove to be 
highly inaccurate, it may be possible to discover those limitations in the 

* Morton A. Kaplan, System and Process m tnlernalional Politics (New York: John 
Wiley and Sons. 1957). 


]49 



lCh.9 

SIMULATION . cimuliilioo 

s,„„U,io„ leading .o .hU inaccu^c, and .□ n,=dJy .he 
accordingly. . }v. more easily ailainablc is an 

3 A goal tor sociiU -ruks" of m.emanonal 

understanding of the ways in relationship to national goals. 

„lanonships afloc, one another. erents- The =m- 

Th.s does not iniolsc accurate T' ; i„,ernaUonal system, 

phasis here is on the Vision makers riO. being 

with the variauon in the goals oi inoiviu 

““muiln eould provide h>P«.h- =^“;:„r„,:frd1Sre?. 
setting them up as game situauons diflercnt sets of 

goals. Any one game situation mi^t . jj probably 

fndiv, duals acting as decision maters. eh decision 

he a great deal of variation in deal of 

maters acre playing the game. Ihcrc ""E ,hj bjsis 

uniformity in what occurs. Hus uniformity '""shw 

for propositions about the process of mtcntation attempting to 

4. l” is always possible to view such rt empb^ 

draw conclusions about the ptoccw behavior of the set 

could, instead, be on esplaimng and predicting the 
of individuals who are playing the pmc. p nons to get., 

mean aehicvement, since it could mate important contnbu 
etal theory. 

9.5 COMPUTER SIMULATION 

Simulation is well adapted to exploration, tor simulation of jt 
usually mates it possible and ennven«„. .0 alter a ''“'■V ““"es of the 

,0 see what the results will be. We can. tor example, '"“"f' "p pp.hesis 
game in various ways without necessarily having a very 
?„ mind. In one type of simnlalion, tnown » "'"P^hltSons. U^r 
dally easy to create variations and quietly discover the t P or 

this mclhod. the lotal system of rules of the game, as well as m 
logical formulations of (for example) the ways m which subiccls m g ^ 
chokes on .he basis of the,, rule, «t die game, are PW™^ ,o 

electronic computer. Such programs can be written so that H ^^gn,inc 
alter rules of the game or formulations of the choice process ^ 
the consequences in terms of the kinds of choices that . choice 

ner the investigator might examine a very wide jh^ories in 

behavior in a relati>ely short period. Hie ^searcher states these 
mathematical or logical terms and then goes on to test them g 
data based on the choices of live individuals. simulation 

The distinction between computer simulation and are 

suggests that simulations can vary in the degree to which ph 


150 



Sec. 9£] SunuUaion and Sequential Relationships 

directly structured or controlled by 3 ghea theory. For example, a simulation 
of the occupational-choice process can be hi^Iy unstructured, with each 
individual constrained by very few ndes in making his choices and free to 
behave in many different ways. Such a simulation might be very valuable 
within the context of discovery, especially if the investigator would like to 
obtain an understanding of the range of behaviors possible within the choice 
process while having relatively few defioilc propositions in mind. If he formu- 
lates a systematic theory, the investigator can structure the situation more 
closely to the propositions of his theory, constraining the individual to operate 
within the confines of a variety of rules. One advantage of such an approach 
is that it allows the investigator to focus detailed attention on those aspects of 
the process that are in need of exploration while utilizing whatever under- 
standings be already has to structure the remainder of the process. As he con- 
tinues to develop effective theory, he can structure more and more of the 
process until he is able to simulate the entire process on a computer. 

Thus, simulation allows the investigator to maximize the utility of what- 
ever knowledge he does possess. By using this knowledge to set up the simu- 
lation. be achieves a genuine integration of his theoretical knowledge with bis 
research procedures. If be is able to develop this kind of knowledge further, 
he can go on to the detailed iovesUgaiion of pbenomeoa he does not yet 
understand. His theories are put to the test with respect to his abili^ to under- 
stand and predict behavior in the simulated situation and also in more natural- 
istic settings. 

9.6 SIMULATION AND SEQUENTIAL RELATIONSHIPS 

Simulation plays an important role in the analysis of sequences of phenomena 
or processes. Usually, whatever change is investigated in the experiment is 
introduced by the experimenter in the form of the test variable or treatment 
In the before-and-after experiment for example, it is the difference between 
the pretest and posUest situations that is under investigation. Simulation pro- 
cedures, although they can be combined with such experimentation, generally 
involve a much more detailed concern with the analysis of change. When the 
attempt is made to simulate an entire process, such as that of occupational 
choice, a typical procedure would be to collect data having to do with many 
different stages in the process. Each individual might make a long series of 
choices, and the situations within whkrh the choices are made may vary greatly 
from the beginning of the process to its conclusion. For example, the degree 
to which each player has fulfilled each of his goals is contmually chaa^g. 
and this in turn will tend to affect the degree to which he will strive for the 
fulfilled goals, on the one hand, and the unfulfilled goals, on the other. 

The question might be raised as to whether so many changes do not lead 
to the kind of relatively uncontrolled situation often found in field experiments. 
Actually, the structuring of the situation acbkved by simulation procedures, 

lit 



SIMULATION ^ impotcmt sM o! 

on the basis of whatever ">“2 “ " geneiioly allowed to change over 
controls. Althoogh several ^ in the simulation we know 

inthesim»lat.onaswell asm th= 8Jd 

varied or how much „„s, „ot be undet- 

The importance of a knowW^ P relationships by liilmg th= 

estimated. Explanations attempt to deanM ^ ,,^o- 

intetstices between events or suttonn g j,l ^ interstices and 

work. A knowledge of process ntakes rt pos»W= » h „„tostanding 

provide the mote inclusive ,, ^ experimenter introduces 

of what is going on It is one *'"8 ‘ another thing to be able 

a certain change, a given effect will lo . ^ (},e given change and 

to leam about the various mechanism experiment can, 

the given effect and convert the change into he effecL W P 
of curse, focus on some of ihese ‘."“““Xn chSe”, however, are 
able for arriving at an explanation 

especially well adapted to Observational and interview 

ur:r;rv= 

provide the means for making frequent measurements. 

SELECTED REFERENCES FOR FURTHER STUDY 

HARotD Bowno (cd.l. Computer *" ‘S fffrenrexampks of computet 

wood Cliffs. N.J.: Prentice-Hall, 1962). Many coLitive processes, 

simulation are provided- the simulation of visual displays, g 

language behavior, musical composition, the operalio ,arge’ organizations, 

ing of nerve nets, biomedical systems, the functioning of large g 
business processes, and international relations. 

Harold Guetzkow (ed,). Simutofion in Social Science: „ fo simu- 

cuffs. N.J.: Prentice-Hall. 1962). In addition to a 6f"* of simu- 
lation. this volume contains a collection of atlicles illuslrali g business, 

lalion m psychology, sociology, political science. ,be articles 

education, industrial engineering, and military operations. M 
deal with non-computer simulation; a few illustrate compu 

c* p iNcsv Yorlt* 

Guy H. Orcutt et al., Microanalyea of Socioeconomic jg of one 

Harper and Brothers. 1961). TTw author presents a booWefS* « P ,afge- 
particular approach to simulation. The book deals with the sim 
scale systems. Such efforts involve the construction of ^ „s,em. 

systems as well as the coordination of these parts into an integrate y 




Measuremenf and Scaling 

Scientific research is concerned with moving back and forth among levels 
of abstraction. It deals with the development of theory as well as the collection 
of data that provide the measurements necessary to test the theory. Without 
measurement, there can be no scientific testing, and theory remains in the area 
of speculation or opinion. 

If measurement is important for testing theory, it is also true that theory 
is necessary for measurement. The theory may be implicit or explicit, but 
some idea of the nature of the world must enter into any attempt to measure 
a portion of that world. If the theory is an effective one, it will aid in obtaining 
accurate and precise measurement hy providing explicit directions as to where 
or how to look for important phenomena. Theory may also provide the means 
for structuring the research situation, as in simulation techniques, so that 
data providing measurements of important variables are readily available 
from the research situation. 


153 



Chapter 10 discusses fundamental quKtions ^ ° t™;* l' MdK 

validity, reliability, precision, and »ales ^ procedures that may 

take up the dilierent types of “j^lied into nominal, ordinal, 

be used to construct such scales. The scales are c . ^ell as the 

interval, and ratio types, mtd Heir mathemal.oal ptoperlies 
functions they serve are discussed. 



Principles of Measuremenf and Scaling 


lO.l STRATEGY OF MEASUREMENT 
Why Measure? 

One important function of measurement is that it enables the researcher to 
test bis propositions and theories, hteasuics indicate v.hen a pveo enti^, such 
as John Smith’s preference for pathology, is present or absent in a given situ- 
ation. Measures usually also indicate degrees— <.g., how important pathology 
is for John Smith. Scientists utilize objective measurement procedures so that 
each can compare his view of the world with that of other scientists. Although 
it is generally recognized that these procedures do not necessarily produce a 
picture of reality, scieniUts assume that this is the best way of getting as close 
to reality as possible. 

Measurement, thefl, functions to tie any concept (no matter how abstract) 
to reality by telling the scientbt what his observations imply about the phe- 
nomena to which the concept refers. Measurement thus sets the stage for 
testing a ^ven proposition by allowing the investigator to make observations 
bearing on the various concepts involved. When this is done, a given hypoth- 
esis can be empirically tested and conclusions can be drawn, on the basis of 
the available eridence, with reference to its truth or falsity. 

Measurement is also of aid within the context of discovery. The process of 
constructing the measurement very often can inBuence the researcher to de- 
velop his ideas about the concept in question more explicitly in order to be 
able to build a more accurate measurement. In addition, discovery is advanced 
as a result of the testing of propositions, for the results indicate the next steps 
the investigator should take. 

The Measurement Process as Data 

For the behavioral scientist, an understanding of the measurement process 
oiay be an end in itself. For example, an undentanding of the interaction 
between subject and experimenter can furnish ideas about the process of 

155 



principles of measurement and scaling 

social inlcracUon (context ot 

hypotheses about social interaction J of this as a more 

true for the mcasurcincnl process, for «« ™ ^ data^oUcction process, 
detailed way of looVing at certain aspccU context of the medical 

Ut us consider the mcasurcmcnl of values -mi.ics represent the goals 

student study. It might be hypothesized that ,f certain cntit P ^hetn 

of the individual, he will be able ,^10^ L hypoth- 

from entities that do not represent his goa s. ilisiinctions in an appro- 

csized that the intli.nluaUill .cibalrtc or reroi ^ unable 

pnatc intatvicat or qucitionnairc simoon. <‘“1’™^"“ Jlj^larly to fairly 
or unntlling to indicate a gi.cn fact. This may P pressures, 

abstract concepts, ah, eh arc often su.toundcd by a vanev 
The fust hypothesis, hoac.ei, stales verbafte or 

guish his oan values; the second stales that he »ill he ntinng 
record those values. , t testing these 

There is a direct, a panlally direct, and an designed to lest 

hypotheses. The direct method involves a study / .bin a study of 

eaSh of them Such a study may. ot COUDC, be be 

other aspects of the respondent's behavior, such as might 

chooses a field of medicine. Using the first hypothesis, tor °|u„„airc5 

adopt some procedure for measuring goals other than ‘ 1 ;' 2 “'S„i,ue. 
or direct questions. This might involve ptojccuvc or olher . . j,. pmsl 
ot observation of choice behavior in simulated sduations. J' ’ ,( 

ihcn be devised for measuring rhe respondent s ®‘ necc^ary to 

we arc to distinguish between the two hypotheses, it will because 

measure awareness by means other than a questionnaire or m e ’ __nj«nt 
the second hypothesis deals specifically with the willingness of t^c r P 
to verbalize such awareness in those situations. Thus, an indepcn ...ted. 
of getting at awareness must be found if the second hypothesis is to ^ 

This might involve a prolonged observation period in a non-intcnicw j^^t 
or possibly the individoal’s indications of his awareness within t c _ 
of a simulated situation. Once the respondent's awareness of va ues a 
measured, it can be related to his actual scrbalizalion of such aware 
questionnaire ot interview situations. _ _ coals 

A partially direct nicihod might simply measure the individua * ® . 
(whether or not they arc in his awareness) by indirect techniques an 
measure whether or not these goals arc verbalized in an interview or q 
lionnaire situation. The variable of awareness intervenes between the 
of goals and their verbal expression, but this partially direct approach m 
that if goals exist and if they are verbalized, then they also occur in the awa 
ness of the individual. 

One indirect method actually involves no special study of the measur 
ment situation. The investigator simply proceeds with his analysis ot the re a 



Sec. JO.]] Strategy of Measurement 

tionship between the individual's values and his choice of medical fields, 
utilizing the interview or questionnaire measures of values. If the investi- 
gator is able to explain or predict the respondent's choice of field rather well, 
then this achievement indirectly implies that his procedures for measuring 
values are effective to an extent. According to this approach, it might be more 
generally stated that whenever propositions having at least a degree of support 
in the literature are confirmed through certain measurement techniques, the 
effectiveness of these techniques is to an extent demonstrated. 

An examination of the measurement process itself, then, can add to the 
body of knowledge within the context of justification through direct or indirect 
tests of hj'pothcses. It can also aid within the context of discovery. Instead of 
approaching the measurement situation with a view to testing specific hypoth- 
eses, or alter doing so, we might iiy to discover the various factors that trans- 
form the goals of the individual into the neat answers he gives in a question- 
naire or interview situation. Although we may surmise that the individual’s 
awareness of his own goals is probably biased in the direction of social desira- 
bility, there exists little in the way of research to show us the dynamics of 
this process. The researcher might proceed by measuring the social desirability 
of various values as seen by a given respondent and then try to determine, 
using relatively unstructured techniques, the mechanism whereby expectations 
about social desirability affect the individual’s awareness of his own values. 
As for the verbalization of awareness— i.c., the willingness of a respondent 
to present an undistorted picture of himself — the investigator might utilize 
some technique other than direct questions. Prolonged association with and 
observation of respondents might be one approach. Another might be the 
experimentation with a variety of different interview or questionnaire tech- 
niques in order to develop one that maximizes the willingness of the respondent 
to reveal his values. In such ways, the investigator might learn a good deal 
about those factors that increase and decrease the likelihood of distortion. 

Theory and Measurement 

It seems rather obvious that measurement and theory ate integrally linked 
and that unless measurement furthers the development and testing of propo- 
sitions and theories (whether within or outside the measurement situation 
itself) it has little utility. One implication of this point of view is that measure- 
ment is a tool for achieving effective theory, and that the precision of the 
measurement is no guarantee of its utility. We can frequently measure the 
age of individuafs almost to the minute, and yet such precision has thus far 
produced nothing very remarkable with respect to scientific propositions and 
theories having considerable scope. 

Does this mean that it is useless to try to obtain precise measurement? 
Not at all, for precision might substantially aid in the development of theory. 
Without precision, it becomes diffiailt to State relationships between variables 

157 



PRINCIPLES OF MEASUREMENT AND SCALING I 

m a ckai-cut maanar, and thU bampais thn devdopmenl of theory 
alone, however, implies neither scope nor systematic import. 

Some Choices ttnd Strategies •. ,1,, choice of irtal 

Perhaps the most important choice of the ‘“''“'’6““' ' ^ to tciviai 

to measure. Elaborate and precise measurement lechniq PP 
concepts will be worse than useless because *‘1'' “P"" . “ c may yielii 
resources. Even very rough .rSied 

interesting and even important data when they PP concepts 

cepts, propositions, and theories. Most ® ... the develop- 

that are important with reference to existing theory efforts 

meat of theory Thus far, however, the i, ,„e fliat many 

that have gone into measurement have been rare, altnoug validated, 

propositions with relatively limited scope have been, o ’ .jj 

Another choice facing the invesligator is the degree “ " hes 

on an analysis of the measurement process itself. Several 
to such analyses have been discussed, and any forthcoming r ujement 

to our knowledge of the dynamics of human behavior wit m god 

situation and aid in the development and testing of improve p P 
theory outside of the measurement situation as well. scaling 

A third choice faced by the researcher involves ,v. nature 

procedure. Each procedure carries with it certain assumptions a degree 

of the phenomena under investigation, and each produces a i c ^^.^n 
of precision. The number of assumptions made and the degree o p ^ 
generally vary together, so that a greater number of -^5 and 

yield a greater degree of precision. But assumptions may be erro » 
the researcher who seeks a great deal of precision may make assump * . j^n 

ate far from correct, On the other hand, if he attains a high degree o p 
on the basis of assumptions that arc only partially incorrect, he may 
a good deal of theoretical knowledge and subsequently be in a better p 
to improve his measurement procedures and eliminate the errors invo ve • , 

The American Psychological Association has been patticularly cooce 
with problems of measurement within the context of the construction o p 
chological tests and has compiled a number of recommendations.’ 
course, must themselves be evaluated in terms of utility. At the very • 
they alert the investigator to some rules of evidence that are widely accep 
by psycholo^sts and indicate procedures of which he might not have c 
aware. 

* An ofliciil sialement of these recommendatioos is presemed in ‘Technical 
mendalions for Psychological Tests and Diagnosuc Techniques." Psychological a 
Supplrmciif LI. Pt. 2. t-38 ( 1954) For a more detailed discussion of various . . 

test conslruction. see L. J. Cronbach. Esstmusts of Psythological TeMuig (New toi • 
Harper and Row, 1960). 



Sec. lOJ] 


Mea^uremenl Validity 


10.2 MEASUREMENT VALIDITY 
Validity and Measurement 

In scientific usage a measurement of a given phenomenon (as designated 
by a given concept) is viewed as a valid measure if it successfully “gets at” 
the phenomenon — .'i.c., it constitutes a true or correct measure. The concept 
itself may or may not have a high degree of clarity. If it does, the investigator 
has less difficulty in locating the phenomenon to which the concept refers. 
He then makes the statement that his data indicate that the phenomenon he 
has observed constitutes a measure of the phenomenon to which the concept 
refers. For example, a respondent’s answer to a question concerning his age 
constitutes data that the investigator then utilizes as a measure of the phe- 
nomenon of chronological age. 

It should be noted that the investigator makes a slaiemeiU that his observa- 
tions constitute a measure of the phenomena referred to by the concept. Be- 
cause it is a statement, it can be tested by obtaining evidence for or against it. 
The abstract definition of the concept is not, in itself, a statement about the 
nature of reality, and consequently it may not be judged in terms of truth and 
falsity. On the other hand, the designation of certain data and of rules for 
putting them together as a measurement with respect to a concept does con- 
stitute a statement about the nature of reality. If the answer to a question 
on religion were taken as a measure of age, evidence could be presented to the 
effect that such data do not represent the phenomena referred to by the con- 
cept chronological age. Some of the procedures used to collect evidence as a 
measurement of validity are categorized under three major headings: face- 
validation procedures, criterion-validation procedures, and construct-validation 
procedures. 

Face-Validation Procedures 

The validation procedure that is least objective or perhaps most subject to 
error is face validation. This involves an analysis of whether the measurement 
appears to get at the concept “on the face of iL” Thus, for example, direct 
measures tend to have more face validity than measures based on projective 
or other indirect techniques. 

Face-validation procedures are highly subjective in the sense that two re- 
searchers may easily come up with different evaluations. Perhaps even more 
serious is that surface similarities or differences may not be as important as 
less obvious ones. Thus, for example, the question “Are you maladjusted?” 
may, in spite of its face validity for measuring adjustment, constitute a poor 
measurement of this phenomena — perhaps far less adequate than a Rorschach 
Test with little face validity as a measure of adjustment. 


159 



PRINCIPLES OF MEASUREMENT AND SCALING 1“' 

In his efioits to evaluate the face validity of a tem aud 

useful lot the investigator to ca„full, anal,« the “8 ° Such 

then trace the ,mpl, cations of thrs nteaning for f fte 

an analysis can often reveal a good deal ^ „ia,ed to the 

concept and lead to ideas about other variatjles that “S*" . ' ,aion of 

concept. The term expheation may be used to designate a rp 
a ^ven concept in this manner: 

An explication of a given set of tenns. then. the 

meaning analysis and of empirical analysis. Taking its ep jimita- 

customary meanings of the terms, explication aims at “ oroDOunding 

lions, ambiguities, and inconsistencies of their ordinary us g V P 
a reinietpretation intended to enhance the clarity an preci 
meanings as well as their ability to function in hypotheses and t 
explanatory and predictive force* 

Thus, what might begin as an exploration of face validity ^^tion 
through the process of explication, to be an improvement in cone P 
and lead to the emergence of new ideas for measurements and yp 

Perhaps the strongest argument that can be made for idaiion P 

cedures Is that they are not very time-consuming. If the invcsti^ ^ouid have 
mg an interview schedule containing a large number of items, he w 
to expend a considerable proportion of his limited research By 

elaborate validation pcoc^ures were to be used on many of these i 
limiting his attention to face-validation, such resources might be more 
fully employed, for example, providing a longer training period for is 
of interviewers. 


Criterion-Validation Procedures 

Let us now pursue the process of explication to its logical conclusion. Sup^ 
pose that such a reinterpretation of a given concept or a set of concepts i 
effected and results in the development of a system of hypotheses or propos 
tions with the concept constituting an important element of this theory. A nex 
step would involve actually testing thccs hypotheses, and it is with reference 
to such tests that criterion-validation and construct-validation procedures are 
concerned. They differ from face-validation procedures in that they are base 
on objective testing, although the resulting inference with respect to validity 
is still uncertain. 

Criterion-validation procedures involve a new measure and a well-accepted 
measure of a given concept. The wcU-accepted measure may be utilized as a 
criterion to aid in the assessment of the new measure. For example, Uf n* 

> Carl G. Hempel. Fututamenlals of Concept Formation in Empirical Science (Chi- 
cago: University of Chicago Press, 1952) 

160 



Sec. 10.21 Measurement Validity 

assume that the structured-values question in the medical student study 
(choices: indispensable in a job, extremely important, very important, fairly 
important, little or no importance) is the well-accepted measure of values 
serving as the criterion, whereas the less structured careers-value question 
constitutes the new measure being assessed. We may then proceed to investi- 
gate how close the relationship is between the criterion and the method under 
assessment — i.e., whether individuals* values as measured by the structured- 
values question are very close to their values as measured by the careers-value 
question. 

A number of questions may be raised with reference to criterion-validation 
procedures. One of them has to do with the procedures whereby the criterion 
measure itself was validated. If this validation was achieved with the aid of 
another criterion measure, then the question may be raised once more, and 
this may continue indefinitely. An additional question involves the utility of 
criterion-validation procedures in the first place. One may wonder why an 
investigator who already has a well-accepted measurement for a given concept 
should desire to create and attempt to validate some new measure. A third 
question involves the conservativeness of criterion-validation procedures. If 
present criterion measures are trivial and unfruitful and yet are used to judge 
new measures, then there is little hope for achieving a breakthrough in 
knowledge. 

Construct- Validation Procedures 

The limitations of criterion-validation procedures have given impetus to a 
concern with the total set of propositions in which a given concept is located, 
and this has to do with construct xalidation. Suppose that we proceeded to 
test not only the relationship between the structured-values question and the 
careers-value question but also hypotheses about the relationship between 
values and preference for medical fields. Within this context, values would 
constitute an independent variable and preference a dependent one. We also 
might choose to relate values to other dependent variables. It might be con- 
venient, in addition, to consider values as a dependent variable and test hy- 
potheses about the origin of values. If. in addition, other investigators had 
done similar studies, all these data could then be utilized as evidence for or 
against the validity of the careers-value question. 

To illustrate the way in which such evidence might be used, suppose that 
the hypothetical relationship between expected value deprivation and prefer- 
ence for a medical field is supported directly or indirectly in the literature. If 
this hypothesis is correct, then any valid method for measuring values should 
also result in its confirmation. In general, to lest a measure with respect to a 
given concept for construct xaJidity, we examine several propositions that 
incorporate this concept and that have already been validated to some extent. 
The measure in question is then utilized to test these propositions once more 

161 



PRINCIPLES OF MEASUREMENT AND SCAUNG 

and. if they arc confirmed, »c may infer that the measure has 

At this point s^c may examine ^hclhcr or not such 
tion can repair some of the deficiencies insolscd in cri njcuure being 
ccdufcs. One might ask how the prop^ition*. m wh|ch ‘nc 
tested IS inserted, were validated in the first plKC? If they . jj,e 

basis of certain specific measures of each of the concep ^-’measures 

general proposition is validated only to the extent . measures for 

IS valid Uut here we are back again to the problem of validating 
given concepts, except that instead of having only the ongi 

concepts, we now have additionalonestoconsidcf. construct* 

Thus far. it would seem that neither criterion-validation n 
validation procedures oiler a means for definitely validating cannot 

nnot 

themselves be definitely validated The scientist, however, ■, 


mcni It 1$ also apparent that propositions that incorporate cannot 

be definitely validated, as such validation involves measurements 
themselves be definitely validated The scientist, however, . j jj 

this lack of definitive verification and settles, instead, for c'l c ^;j3tion 
definitive but is nevertheless valuable. As this applies to .. j-jjeate 

and construct validation, each time that a relationship that 


and construct validation, each ume that a rciauonsmp p_,j ajji- 
the validity of a given measure for a concept is tested and con i j,„fc 
tional evidence tor its validity is produced. We also may obtain so 
of evidence lot validity whenever a given concept is explicated. . Mija- 
The second question raised with respect to the adequacy of criterion- 
tion procedures, and which also might be posed for conslrucl-validauo p 
ccdurcs. concerns their utility. The investigator never possesses a P® . , . 
valid measure of any given concept. He might do well to become diss 
with wcll-acccplcd measures because tlicy may not be as fruitful for I c 
opment of theory as new measures are. Each lime some evidence is o a 
as to the validity of a given mcasuic for a concept through the 
of a proposition, evidence is also obtained as to the validity of the proposi 
as well as any theory within which it is implicated. Consirucl-validation p 
ccdurcs have an advantage over criterion-validation procedures because 
alert the investigator to an entire system of propositions, rather than 
the relationship between the new measure and the criterion measure, o 
slrucl-validation procedures thus have to do with the same kind of orieniatio 
to the research process as does a concern with theory, and the dcvclopmcn 
of theory that can explain and predict cllccUvcly may be an end product o 
both. 

10.3 reliability and precision 

Reliability 

If a spring scale is utilized to measure the weight of an individual, and if 
the indicator points to ihc same number of pounds when the individual steps 



Sec. lOJ] Reliability and Precision 

on and off the scale several times, then wc have obtained evidence that the 
scale is a reliable measuring instnimenL If, on the other hand, the indicator 
points to different numbers each lime, then wc have evidence for its unreali- 
abilily. More generally, a measuring instrument is reliable if it yields the same 
result in repeated applications to the same phenomena. If the phenomena 
change — e.g., if an individual becomes heavier between weighings — we would 
naturally expect the result obtained from the measuring instrument to change 
accordingly. The importance of reliability stems from the fact that it gives the 
investigator confidence that variations in his data are not the result of imper- 
fections in the measuring instrument itself. 

The importance of the concept of reliability also stems from the ease with 
which reliability may be assessed relative to vrdidiiy. In determining reliability, 
we are not dealing with the entire system of propositions within which a 
given concept is involved but, rather, wiih the behavior of a given measuring 
instrument under different conditions Although it is easier to assess the reli- 
ability o! a measuring instrument, the establishment of its validity is generally 
more valuable. For example, a spring scale constitutes a perfectly reliable 
measuring instrument if it indicates exactly 2 pounds each time an individual 
stands on it, foe it is then yielding the same result in repeated applications to 
the same phenomena. But there would be good reason to believe that such 
measurements did not constitute valid or correct indications of the true weight 
of the individual. Thus, a measurement of a given concept may be quite reli- 
able and yet be quite invalid. The reverse does not hold, however, because 
a measuring instrument that repeatedly provided a valid index for a given 
concept would have to be quite reliable as well. 

As with data-collection methods in general, the investigator can view the 
situation wherein reliability is being tested not only as a means to aid the 
achievement of other ends but also as an end in itself. If it is true that a given 
process of measurement produces a reliable measure, this can aid the re- 
searcher to develop and test propositions that explain why the measurement 
situation does in fact >ield a reliable measure. If, for example, a reliable meas- 
ure for values can be obtained, then certain factors within the measurement 
situations were working to produce the same result. If these factors can be 
identified and their relationships to the resulting data tested, a contribution 
may be made to our knowledge of behavioral processes. Viewing an under- 
standing of the measurement situation as an end in itself can also lead to 
important knowledge esen in cases where a given measuring instrument proves 
to be unreliable. In such cases the investigation centers around the factors 
that produced (fiffering results m the two measurement situations. 

Thus far reliability has been discussed with reference to the stability of 
measurement — i.e., a consideration of variations from one time interval to 
another. In the example of the spring scale, we evaluated reliability by de- 
termining whether or not the same lesult was obtained upon successive meas- 
urements. Another way of examining the reliability of the scale is to construct 

163 



PRINCIPLES OF MEASUREMENT AND SCALING ^ 

another equivalent scale. If measures obt^rf by procedures 

then we have evidence for the reliabUity of the others 

for assessing reliability focus on stability, other on equivalen . 


on a combination of the two.® . ..ajious data- 

Whereas techniques for assessing reliability are helpful the 

collection methods, these techniques have . .1 assess stabil- 

context of questionmure and interview techniques. In a P ^ 

ity, the respondent may be interviewed several times m w the test 

retest procedure, and a measure of the degree of agreeme , jhat the 

and the retest may be obtained. Of course, there exis s .^urs may be 

initial test will affect the subsequent test, so that a change appli- 

the product not of the measuring instrument itself but o i measuring 

cation, There is also the possibility that factors phenomena being 

instrument itself have resulted in an actual change m e p .. ..jjjgj are 
measured. The general procedures for dealing with such p the 

similar to those utilized la the experiment and may involve, ..-arate the 
introduction of a control group A less elaborate procedure | . g|hpi. 

test from the retest by a period long enough to reduce subs ohenomena 
nate the effect of the initial test and yet not so long as to change p 
being measured to any significant degree. . . • u of eauivo- 

One widespread approach to the assessment of reliability by use jj 

lent questions Is the split-hall method. If we assume that the . jjems, 

obtained by putting together information from a number of dine pjggs- 
then a larger number of items may be used than might ordinanly 
sary and the items may be divided into two halves after the data a 
collected ♦ The two halves are then compared to test for 

Although it may be valuable to test the reliability of a given measu 
strument before using U in investigations, is it useful for the *'1''®^**®* j jo 
follow this rule invariably? A similar quesUon might be posed with resp 
the validation of a given measurement: 

Unless satisfactory validity or reliability has already been demonstrated 
the reliability of a measuring instrument should be determined before it ' 
used in a study, raiher than alter. If a research instrument is plagued y 
variable error, the likelihood of achieving significant results is mimm*^® • 
Rather than go ahead with unreliable instruments, it may be prudent o 
delay the research and try to increase their reliability.® 

’ The distinction belv-een slabiluy and equivalence has been developed by U. L 
bach m "Cocfficienl Alpha and the Internal Struclure of Tests.” Psychomeirii<‘<- 
297-339 (1951). 

• For a discussion of the various procedures that have been developed to 
approach, see H Oulliksen. T/irory of Mental Teitj (New York; John Wiley and bon . 
1950). 

■ ClaueSelllie. et al , Restateh Meiltotf, » SocioJ Relations (New York: Holl. R'®*' 
hart and Winston, 1963, p. 167). 



Sec. 10.3] Reliabiliiy and Precision 

The advice implies that research should proceed in stages, and that the 
development of valid and reliable measurement should usually precede the 
investigation of phenomena. But research actually seems to proceed in a back- 
and-forth fashion rather than in stages. Imperfect and even very rough meas- 
urements may not be easily improved until they are actually utilized in the 
testing of hypotheses. It may often be preferable to proceed with rough meas- 
ures, because the wait for measures may be a long one — especially if the 
insight derived from actual research with the measures is necessary for their 
improveraenL 

One indirect but effective approach that may be used to increase the reli- 
ability of a given measurement involves the various validation techniques. The 
fault may lie not only in the measure but in the concept as well. For example, 
the concept may not be a unitary one, but may include ditferent types of phe- 
nomena. Explication of the concept, as well as research which explored the 
construct validity of the measure, might lead to the dchaition of both a more 
unitary and improved concept. Another approach, already discussed, in- 
volves an analysis of the mcasutement situation itself with a view to determin- 
ing what factors produced the unreliability. 

An additional approach, which has Iwen developed within the context of 
psycholo^cal testing, calls for adding measurement operations (for example, 
items in a questionnaire) of the same type as the ones used initially and utiliz- 
ing the total set of items, rather than the initial set alone, to arrive at the score 
of a given individual. Under this procedure, very high reliabilities may readily 
be obtained. The rub here is that, the greater the number of items included 
within a given measurement, the more difficult it becomes to know what the 
total score is getting at — i.e., what concept or concepts it is measuring. 
Within a research context it is generally useful to avoid measures that apply 
to several diflerent factors or dimensions. Thus, although an additional set of 
items may increase reliability, it may also result in a measurement that is too 
heterogeneous to be fruitful for the development of theory. 

Precision 

A given spring scale may be reliable to the pound but not to the ounce. 
This would occur, for example, if the weight of a ^ven individual varied 
between 167 pounds, 2 ounces, and 167 pounds, 5 ounces, on the scale in 
question. Reliability, then, should be understood within the context of a given 
degree of precision, and the greater the degree of precision desired the more 
difficult it is to obtain the appropriate degree of reliability. The precision of 
the measuring instrument is here undeistood to vary inversely with the width 
of the smallest interval it can report; very precise scales may provide measures 
to the nearest tenth of an ounce, whereas less precise ones may measure to 
the nearest pound. 

It is relatively easy to obtain a high degree of reliability when little precision 



PRINCIPLES OF MEASUREMENT AND SCALING 

is needed. Thus, for example, it would bo easier to te 

measurement of a given value if only two degrees of th 

(for example, high and low) than if live degrees were desired. 

As a science progresses, the precision of measurement ‘o 

and the scientist is able to develop more definitive tests of pTamnle a 

larly those specifying degrees and types of relationships. Tnus, 
distinct advance is represented by moving from a descrip lo column 

in terms of hot and cold to a description in terms of the expansi 
of mercury. When we deal with the mercury thermometer, we ar . 
a high degree of reliability for a high degree of precision. ° 
less to create multiple categories that give the appearance o 
degree of precision unless reliability is sufficiently high. In ac , i pgjjefal 
useless, but misleading as well. Within the physical sciences i is 
rule that the investigator should not include so many signific^ Evolved is 
pven measurement that a misleading portrayal of the relia i y . . 
implied. An ammeter dial fluctuating between 25 and 27 ampM® 
be read as 26.000 amperes but rather as 26 amperes or 26 * 1 an^P 


10.4 SCALES OF MEASUREMENT 
The Nature of Scaling 

Just as there are procedures for improving the reliability of 
so also are there techniques lor increasing its precision. A thermome e 
out any calibrations whatsoever may constitute a highly reliable J”®* . ^ 

device, but the extent of this reliability could not be fully revealed un i 
thermometer were calibrated. Once this is done, the thermometer can l ^ 
used to determine temperature under a wide variety of conditions. 
measurements may then be of great value in the development and testing 
scientific theories. 

Scaling procedures may be viewed as techniques for producing such thing 
^ calibrated thermometer. If the thermometer works effectively, 
inside it will not expand in an erratic fashion. The same holds true for effec i 
scaling techniques. The behavior of the liquid within the thermometer may 
itself be the focus of investigation, but thermometers are usually conceive 
within a research context as providing means for the investigation of ot er 
phenomena. Similarly, an analysis may be made of the process that leads o 
some aspects of human behavior forming a precise scale, and this may be o 
interest in its own right. Research procedures in general, however, emphasize 
an exploration and testing of the various relationships ot scaled behavior to 
other behavior. Furthermore, a pven scaling technique may produce the kinds 
of measurements that may be reliable and precise — and yet such measure- 
ments may be useless if they have very few relationships with other phe- 
\66 



Sec. 10.4] Scales of Measurement 

nomena. Oo the other hand, a \eiy rough categorization may be extremely 
valuable for the progress of kno\vIedge it the concept involved has systematic 
import 

Scaling technique is a procedure for the assignment of numbers (or other 
symbols) to a property of objects in order to impart some of the characteris- 
tics of numbers to the property in question. The calibrations on a thermometer 
indicate numbered degrees (whether Fahrenheit, Centigrade, or Kelvin). 
When the temperature of a given entity is measured, a conclusion is reached 
as to the assignment of a given number of degrees to the temperature (a 
property) of that entity. Because of the elTectiveness of the thermometer as 
a scaling instrument, both the order characteristics of numbers (2 is larger 
than 1, 3 is larger than 2, and soon) and such characteristics as the additivity 
of units (1 + 1 =2, 2+1 =3, and so on) are imparted to temperature (the 
property being measured ) . 

Is it necessary to impart the characteristics of numbers to a given property? 
Is it not true that the phenomenon already has these properties? The answer 
is that phenomena are organized by human beings. Thus, it is not the phe- 
nomena that have the characteristics of numbers ready-made but the human 
beings who measure properties of phenomena in such a way that the char- 
acteristics of numbers may be imparted to these properties. In each case, it is 
a property that receives these characteristics — e.g.. quantity, length, weight, 
temperature, and so on. In each case, it is a specific human invention that 
makes it possible to impart some of the characteristics of numbers to the 
properties of objects. Reliable and precise spring scales may be used to meas- 
ure weight, for example, but the spring scale had to be invented first. 

Once such a device is invented, there is no guarantee that it will actually 
impart some of the characteristics of numbers to the properties being meas- 
ured. If, for example, the spring scale pointer or the fluid inside the ther- 
mometer varied erratically, we would not be able to measure the weight or 
temperature of entities in such a way that each property could be ordered or 
added. 

If liquids are being combined, for example, the property or variable of 
volume does not necessarily have the characteristics of additivity. Two liquids 
may chemically react with one another and the volume of the two together 
may differ markedly from the arithmetic sum of the initial volume of each. 

An example, which Is generally of greater interest to the behavioral scien- 
tist, has to do with the measurement of v’alues. In measuring monetary goals, 
for example, although we can add the quantity of money, such additions do 
not generally present accurate increments of value to the individual. To a 
very wealthy man, an increment of S5000 over hlsprescnt yearly income may 
not be very important, whereas such an increment for an individual with a 
salary of $4000 per year would almost invariably constitute a goal of great 
value. Behavioral scientists are presently engaged in attempting to measure 

167 



PRINCIPLES OF MEASUREMENT AND SCALING 
values in such a way that the characterUtic of additivity is 
ter 12). If they arc successful, it will be possible to deve p 
propositions that incorporate the concept “value. 

Scaling Procedures and Strategies 

Once 11 IS undcrslood that the characteristic of Toth 

matically inherent in phenomena, and once the importance o i p 
characteristics is understood, we may next inquire as to procc gf 

complishing this. Scaling procedures, as has been * ^ies of 

imparting such characteristics as order and additivity to various p 
phenomena , ai,® Jntecers 

The number system we use was developed in such a way t a jj 

would have such characteristics as order and additivity. The num c 
built around a set of axioms from which theorems arc derive . 
true of plane geometry. Neither number theory nor plane natural 

can be effectively applied to nature unless the axioms hold true pjoposi- 
phenomena Thus, plane geometry ‘•worls*’ on a plane surface 'l ^ ,g {jje 
tions such as the Pythagorean theorem predict quite accurately *a»ioms of 
axioms of plane geometry seem to be true for a plane surface. The 
plane geometry, however, do not hold true for the surface of a sp cr . 
same way, if the axioms of number theory do not hold for a given 
phenomena, we cannot expect to be able to apply its theorems to the p ^ 
in question. On the other hand, if the axioms arc true for the given pr p ’ 
then we can make use of the theorems. . ^ jq 

It seems then that in order to impart various characteristics of nutn 
a given property of phenomena, it is necessary that the axioms of nu 
theory hold true for the property in question. This is not an all-or-none a > 
however. Just as no proposition about nature can be validated with a 
certainty, so it is true that the validity of the axioms of number theory can 
be proved with certainty. Scaling procedures vary in the degree of attenU 
given to testing these axioms, and this variation reflects different strategies 
measurement. The dilfercnccs in measurement strategy are similar to 
different approaches to testing reliability, validity, or scale properties of tn 
measure before using it in testing propositions. 

Each approach involves a possible gain and a possible loss. By choosmg 
to perfect instruments for measurement before using them, the results ® 
investigation may be more impressive within the context of justification. T e 
pos>ible loss here is the delay involved in perfecting the instruments and, m 
a more extreme version of this strategy, the neglect of the kinds of phenomena 
tor which It IS difficult to develop reliable, precise, and apparently valid 
rneasurements. It he proceeds before the instruments have been perfected, 
the investigator may get erroneous findings, but the possible gain is an nn- 
mediate increase m his knowledge and the opportunity to utilize this knovvl- 



Selected Relerences for Further Study 
edge to improve the instruments. Between these two choices, of course, there 
exists a whole range of alternatives. Whichever choice is made by the investi- 
gator, he vsould do well to take into account the possible loss involved and 
attempt to compensate for it as best he can. Thus, if he proceeds to use a 
given measurement of unknown validity, reliability, and precision, he might 
give priority to the further development of the measurement on the basis of 
his research. On the other hand, if he proceeds to develop the validity, reli- 
ability, and precision of a given measurement, he might make certain that the 
measurement is actually used in the development and testing of propositions 
as soon as possible. 

SELECTED REFERENCES FOR FURTHER STUDY 

Lee J. Cronbach and Paul E. Meehl, "Construct Validity in Psychological Tests,” 
Psychological Bulleiirt, Lit; 281-302 (1955). In addition to a detailed presen- 
tation of the rationale for construct-validation procedures, the article includes 
a discussion of other approaches to validation as well as a comprehensive 
bibliography. Validation procedures are seen within the context of ideas from 
the philosophy of science. 

Delbert C. Miller, Handbook of Research Design and Social Measurement 
(New York: David McKay Company. 1964). Most of this volume consists of 
B large number of measures or indices that have been developed for each of the 
following areas: social status, group structure and d)namics, morale and job 
satisfaction, the community, social participation, leadership in the work organi- 
zation, attitudes, and the family. Other sections include selections from a number 
of social scientists on methods and statistics. 

Samuel A. Stouffer, et a1.. Measurement and Prediction (Pnncelon, N.J.: 
Princeton University Press, 1950). This is still the most comprehensive treat- 
ment available of the scale-analysis technique and the rationale behind iL In 
addition, there are several chapters on latent stniclurc analysis. Some parts of 
the book, require sophistication in malbemalics but most of it does noL 


169 



Nominal and Ordinal Scales 


n.l THE DISCOVERY OF CONCEPTS 
Factor Analysis 

Little is scientifically establUbed about the process of effective conceptual- 
ization, although it seems likely that our knowledge of it will increase. One 
reconstructed logic of discovery that has been put forward is called factor 
analysis. This technique has been widely used in the fields of ps)chology and 
education, especially in the testing or measurement of various t>pes of abilities 
and aptitudes. The theory on which this technique is based may best be stated 
within the context of such tests, althou^ the technique itself is quite general 
and may be applied to many types of data. Fruchler states this theory as 
follows: 

A basic assumption of factor anal)sis is that a baliery of ioierconcIateJ 
variables has common factors running ihrough it and that the scores of an 
individual can be represented more economically in terms of these refer- 
ence factors. An individual's score on a lest is dependent upon two things: 
the particular abilities assessed by the test, and the amount of each of these 
abilities possessed by the examinee.' 

Factor anal>sis is based on measures of association, the Pcarysnian product- 
moment correlation coefficient (henceforth referred to as the correlation co- 
efficient) being most frequently used.* We may view the correialion between 
two pven variables to be a measure of the degree to which they arc associated 
with one another, in the sense of Mill’s methoj of concomitsnx \ariaiicns. Thus, 
a highly positive correlation coefficient (the maximum is 1.0) would be pro- 
' B<ni4iaia rruchter, tiuroJucii^ to Faet«f Andim {Pnneetoe. NJ.: D. Van No»- 
usMCo, 1934. p.44). 

‘VVbeftji some invesiigators fetl that Pea/somaa cotlTlcitnu are ih« only sutaM* 
ones cm whuh to periorm a factor tnal>su. ochtrs have oNiineJ loterciiie j rrsu'ii loanj 
d.llctrni mcaurci of asvoaatioa. 

]71 



[Ch. 11 

NOMINAL AND ORDINAL SCALES 

duced if, in relating income to age, income were J^onThip be- 

increases in age. A correlation near 0 would indicate that is 

tween income and age existed. A highly negative corre ( 

-1.0) would indicate that income decreases with ^ ' jj,at 

above-quoted passage, “a battery of intcrcorrelate j association 

each variable is correlated with every other variable. Measure 
are discussed in Section 15.3. _ . :„tercor- 

By means of rather involved mathematical techniques,^ i factors” — 

relations is manipulated so as to produce a number of comm 
usually fewer than the number of tests or variables invo ve i ability 

lations. Within the context of lesling. each factor represents a pa gj jater 

assessed by the set of tests Each test is viewed as assessingtms a i * ^ 

or lesser degree, and the index of this degree is called the factor oa 
test The tests vary in their factor loadings on a ^ven factor an 
ber of factors on which they are loaded. Thus, for example, 3- p -t 0 0 on 
have the maximum loading of 1.0 on a given factor and loa mg pjjjijij 
the remaining factors. In this instance, the test provides the 
measure of the factor in question, but constitutes a very poor mea ^ 
of the other factors. Another test might have a factor loading o • 
several factors and 0.0 on the remaining ones. This indicates * jg of 
provides a fairly good measure of some factors and a very poor m 
the remaining ones. - various 

An individual’s score (j) on a pven test is equal to the sum o 
products: the test’s various factor loadings (ui, flj. Os. Mathe- 

amount of each of these abilities he possesses (xi, xs, xs, x* ■ • 
matically, the equation is: * 


s — OiXt -|- a-xt + OfXi -1- fliJCi . • 

Like other techniques that may aid in the measurement process, 
analysis may be evaluated both in terms of the characteristics of the meas 
it is designed to achieve and in terms of the assumptions or theory ‘^P. 
as to the nature of reality U provides an objective procedure for deve 
concepts that may then be scaled— one of the few procedures 
aid the investigator in formulating concepts. Historically, factor an^ys'S n 
revolutionized the field of psycholo^cal testing by empirically examining 
interrelationships or correlations among the items used for a given test. 
once thought to be made up of relatively homogeneous items with respect t 
the ability being tested have been found to constitute measures of a number 
of difierent factors. This has resulted in the rcconceptualization of many tKts, 
the reformulation of the items incorporated in various tests, and the develop 
ment of new concepts. 


* This equation u 
aid score o( the give 


sr mdepcndenl (orthogonal) factors where j represents the stan^' 
individual on the eivei. o-o in sianJard scores. 



Sec.ll.l\ The Discovery of Concepts 

The possibilities of factor analysis in the context of discovery may be illus- 
trated by the conceptualization of values in a recent study of value statements.* 

Value statements \^ere collected from tests of values, personality tests, 
theoretical treatments of values, and statements made by subjects m group 
discussions. The resulting pool of 872 items was editorially reduced to 143, 
with the aim of retaining representation of all value positions. Factor analy- 
sis of the 143 items yielded . . . Factor I, Acceptance of authority (best 
item: "Obedience and respect for authority arc the most important 
virtues children should learn”); Factor II, Need-detennined expression vr. 
value-determined restraint (“Since there are no values that can be eternal, 
the only real values are those which meet the needs of the given moment”); 
Factor III, Equalitarianism (“Everyone should have an equal chance and an 
equal say”); and Factor IV, Irtdhidualism ("It is the man who stands alone 
who excites our admiration").® 

All those items that had high loadings on Factor IV were examined in order 
to conceptualize this factor — e.g., to find a name for and a definition or de- 
scription of it. These items included the following: To be superior a man 
must stand alone. It is the man nho stands alone h/io excites our admiration. 
The rich mrernof world of ideals, of sensithe feelings, of reverie, of self- 
knowledge, is man's true home. Whoever would be a man. must be a non- 
conformist. The individualist is the man who is most likely to discover the 
best road to a new future. One thing all these items seem to have in common 
may be called individualism, and in this way a concept is formulated. 

Factor analysis, then, provides procedures for the development of concepts, 
which then may be scaled.* Such conceptualization usually takes place before 
the construction of a particular type of scale, and factor analysis constitutes 
one of (he few empirical procedures for the discovery of concepts. The pro- 
cedures generally used by investigators are not so elaborate, and the different 
logics-in-usc vary a good deal. Little research has been done on the problem 
of formulating effective concepts, and thus there arc few reconstructed logics 
(such as factor analysis) available for this purpose. Some investigators, guided 
by whatever Uieory they think important, proceed from theory to concepts. 
Others prefer to immerse themselves in the situation under investigation, 
and on the basis of some process about which very little is known proceed 
to construct concepts. 

Let us now examine the assumptions or theory of factor analysis as to die 

* Robert K. lUIcs anj Arthur S. Couch. "The Value Profile: A Factor AniI)M» Study 
of Value Staiemenis" (mimeo). HxfvurJ UniicrMiy Department of Social Relalionv. 

•/biJ., p. I. 

* It i» alto ptntiblc to ute factor analyus procedure* to place loJividuaU on wale* ortce 
ihete concepit have been dcvelopeJ Each loJtviduat rccei'C* tcore on a pvea factor 
• hich it the prosluct of the loaJmpt of each item or tetl on ihit factor ai»J the loJuiJu- 
al’t peifortnaDve or tcoret on Ihetc lUmt. 


173 



[Ch.U 

NOMINAL AND ORDINAL SCALES 

nature ot reality. In part.cular, how vaBd is the theory in which 
of a given individual is seen as a tunaion ot the abh.ties “““ n„ 

and the amount of each of these abiliues possessed by the indivi 
definite answer is yet available. , i„gg 

The particular variables incorporated in such a *eory wo P J ^ jj 
part m determining its effectiveness. The factors produce y analysis 

can be no belter than the items on which the analysis is based. ^ 

may yield factors that are quite trivial for behavioral science, or 
earth factors that, when measured, lead to the developmen an 
important theory fnrward For 

Alternative theories about test performance might also be put i • 

example, a good deal of social science theory su^ests “ ^ jjy 

personality as dynamic, with emphasis on the fluidity rat er t a _ 
of traits or abilities, Such theory is often tied to an emphasis on e 
ation within which the individual finds himself, including his own 
or conception of it, rather than an exclusive focus on the test mstrum 
the research situation. Such theories, to the extent that they 
rect, suggest the need for changes in the assumptions on which the p 
of factor analysis are based and, consequently, the procedures as we 


Q-Technique 

One critique of the traditional approach to factor analysis or R-techntqu 
has been put forward by Stephenson: 

In the past, although it has been Utile recogniied, the concrete study of 
behavior has depended very largely — in its systematic respects— opo*' 
sumptions about individual digerences and upon the use of large 
of "cases" as a basis for scientific operations and generaliialions. The ri 
liant and penetrating analyses of famous novelists have been rejected fro 
all scientific concern, because they deal, presumably, with particular o®*®* 
or events. ... We prefer the penetrations of the humanists Ito the statistica 
techniques of factor analysis} and would like to bring them to heel, like the 
good hunters they have been, for our scientific purposes.’' 


Stephenson has developed what he calls Q-lechnique. which involves me^- 
ods similar to those of factor analysis but which are applied to a single 
vidual in order to discover the various factors that explain his behavior. This 
technique implies that the search for factors on the basis of data from large 
numbers of individuals tends to obscure an understanding of the dynamics of 
behavior as it relates to any particular individual. Stephenson’s concern ‘‘>s 
with far more than the simple operations called Q-technique; rather, it is wit 


’ W.lham Stephenson. The Stud, of Behavii^: Q-Technique 
(Chicago. University of Chicago Press. 1953, p. 4 ). 


and Its Methodology 



Sec.ll.l] The Discovery of Concepts 

a comprehensive approach to the study of behavior, where man is at issue as 
a total thinking and behaving being.” ® 

One example of Q-technique, which Stephenson provides, involves a series 
of administrations of the Szondi Test* to a given individual. This projective 
test centers on the reactions of the subject to a series of 48 photographs of 
former mental patients — 8 each for sadists, hysterics, catatonic schizophrenics, 
paranoids, depressives, and manics. In the first administration of the test, 
the subjea may be asked to score each photograph, along a range of 0-8, 
according to how much he likes the individuals pictured. In the second ad- 
ministration, the subject may be asked to rate the photographs with respect 
to how God-fearing the individuals appear to be. Each time the subject rates 
the 48 photographs he is required to categorize a specific number of photo- 
graphs within each of the possible scores from 0-8; 


Scer« 

Frequency 


Most Least 

Liked ... l/k e d 

7 4 5 "”'i” '3' 2‘~ J "'O 

2 4 $ 8 10 8 5 4 2 (n s 48) 


In this manner, two sets of scores may be obtained for the 48 photographs. 
Additional sets may be obtained through the introduction of other character- 
istics— such as handsomeness, health, and age — in subsequent administrations 
of the test. The scores on each administration of the test are then correlated 
with the scores received on other administrations of the test. In other words, 
instead of using the scores of a number of individuals on a series of items, 
Q‘technlque utilizes the scores of one individual on a series of items having 
to do with a number of different “condilions of instruction.” Thus, the scores 
under each different condition of instruction in Q-tcchnique substitute for the 
scores of different individuals in traditional factor analysis. Consequently, a 
set of correlations of the scores between each administration and every other 
administration may be produced, and common factors may be extracted in 
the same way as they are in traditional factor anal>sis. These factors, how- 
ever, refer to aspects of a given individual as manifested over a sequence of 
different conditions of instruction; they do not refer to undcrl)ing properties 
which a number of items may have in common. 

Q-technique is subject to the same kinds of assumptions or theory as tradi- 
tional factor anal) sis— e g., that a pven score is a function of a sum or prod- 
ucts. The derivation of factors from the scores of a number of diUcrcnt 
individuals, however, differs considerably from their derivation from the 
scores of one individual over a number of different occasions. The latter pro- 
cedure provides a closer view of human behavior as a product of the individu- 


* ibij^ p. 7. 

•s. Dtn. /nlrvJut 


the Siomli Test (New Vofl: Cniee aaJ Scratton. 1949). 


175 



[Ch 11 

NOMINAL AND ORDINAL SCALES 
al's own definition of the situation. Like R-technique. 
a reconstructed logic for the discovery of concepts, and these P 
subsequently be scaled in various ways. t q. 

In the field of psychology there is some controversy over t 
technique. It has been classified by some individuals as mere y 
traditional factor analysis. Stephenson, however, maintains ^ 
true, and that the differences between the two methods imply imp 
ences in psychological, logical, and methodologies principles. 


Latent Structure Analysis 

Another procedure, simitar to factor analysis and Stephenson s Q 
in that it provides a reconstructed logic for the discovery of concep s 
sequently may lead to the development of scales of various types, i 
structure analysis. 

The basic postulate is that there exists a set of latent classes, 
the manifest relationship between any two or more items on a *1“®* . 

can be accounted for by the existence of these latent classes ^ 
alone. This implies that any item has two components — one whic 
ciated.with latent classes and one which is specific to ibe item- 

This method has not yet come into widespread use, so it is not y®* 
to evaluate its utility. One feature of latent structure analysis is its oc 
probability theory as a means of inferring the relationships between m 
fesi” data (that is, the actual responses of individuals) and “latent" categ 
According to Lazarsfeld: 

The model of latent structure analysis permits us to be more precise abou^ 
how these inferences are made; it brings out the assumptions implie 
various measurement procedures. In developing the model, we shall fin i 
useful to incorporate the notion of probability mechanisms into a mo 
elaborate scheme." 


Latent structure analysis may prove to be of particular value in . 

light a systematic set of assumptions or propositions about the proba ' 
relationships between manifest data and latent structures. By exposing t 
assumptions or propositions about the measurement process, it may lead to 
more thorough understanding of the process itself. Lazarsfeld expresses it. 


The purpose of latent structure analysis is to provide mathematical models 
by which the various uses of itemized tests can be related to each other. The 


n (Princeton. N 1.: Princeton University 


''froruremrnt a«t 

Press. 1950. p 19). 

P Conceptual Introduction to Latent Structure Analysis, i" 

pjess of Thmhng m the Social Sciences (New York: The 



NOMINAL AND ORDINAL SCALES 

tantism would usually include all the Protestant denominations, where^ a 
nominal scale for political affiliations within the United States wou g J 
provide categories for the minor parties as well as the 
minor parties were lumped together under a category c . Like 

viduals within the domain of pohtical affiliation would still J’® . jj 

other scales, nominal scales are the result of a measuremen p degree, 

well to view them in this context. The categories or classes a , ^ 

abstract, and their use in research depends on the dcvelopm 

measures or operational definitions of them. The categories example. 

or variables with which they arc involved vary in scope. 1''“; «iiatioDS 
the numerous Protestant denominations, or the different po 1 1 . 

within the United States, have considerably less scope than the cat g 
respect to sex. The difference in scope here pertains not on y dimen- 

of individuals each set of categories would include but also to 
sion. Presumably, human beings have been differentiated into m 
males since their origin and will continue to be differentiated m e United 
the denominations within Protestantism and the political sector o 
States have changed in the past and will probably continue to 

It is also important to evaluate nominal scales with respect to 
import. In the field of biology, nominal scales with respect to 
deal of systematic import because there are many variables or char 
associated with or related to sex. Each sex has many unique at dons, 

characteristics, and these relationships help to provide theories, exp ’ 


and predictions for a wide variety of biological phenomena. 

Although nominal scales may seem quite simple and straightforwar , 

thcless difficulties are involved in such classifications. This is obvious i . 
of all, we bear in mind that such scales arc results of a measurement pro 
that has all the usual difficulties with respect to the establishment of re la * 
validity, scope, and systematic import. In addition, however, there are 
particular problems associated with the attempt to impart the charac e 
of uniqueness possessed by numbers to classes or categories of a given p V 
erty or variable. Let us take, as an example, a classification of societies in ^ 
sacred and secular ones, with the categories referring to differences in tra 
tionalism. As in many nominal-scale classifications, there is the u, 

what to do with the borderline cases. If these cases are assigned to one or 
other of the two categories, there lends to be a certain degree of arbitrariness, 
for such cases might just as well be assigned to one category as to the other. 
The same kind of problem mi^t arise in an interview situation in which ® 
respondent is asked to choose the major political party he most favors. A given 
individual may equally favor both the Democratic and the Republican parties 
and he may resent being forced to make a choice he feels is not an accurate 
representation of his true orientation. 


178 



Ordinal Scales 


Sec. HJ] 

The problem here seems to be that the nominal scale is not sensitive enough 
to take into account the various gradations within a given category, and the 
development of scales with more of the characteristics of numbers would 
seem to be required. Thus, for example, sociologists generally have conceived 
of a continum with the sacred society at one pole and the secular society at 
the other. This can also be done with respect to political orientation, with an 
extremely favorable view of the Democratic party at one end of the continuum 
and an extremely favorable view of the Republican party at the other. Such 
refinements do not guarantee that the resulting data will have systematic im- 
port, but they do increase the precision of the measurement and thus provide 
the potential for doing a great deal more with it. 

11.3 ORDINAL SCALES 
Axioms for Simple Order 

Whereas the characteristic of numbers imparled by nominal scales to prop- 
erties or variables of objects is uniqueness, ordinal scales have to do with the 
order relationships manifested by numbers. Thus, 1 is less than 2, 2 is less 
than 3, and so on, with the numbers indicating rank in this context. If indi- 
viduals were lined up in such a way that their height increases from the front 
to the back of the line, then their height (the property in question) might be 
ranked by assigning a I to the first in line, a 2 to the second, and so on. These 
numbers should not be assumed to imply anything more than a rank order. 
For example, the difference in height between Individual 2 and Individual 1 
may be much greater than that between Individual 3 and Individual 2. In 
addition, Individual 4 is certainly not four times as tall as Individual 1. 

The assignment of ranks in terms of height is justified by the statement that 
the individuals are lined up in such a nay that their height increases from the 
front to the back of the line. In actual research, however, no one tells the 
investigator that real phenomena arc rank ordered and that he is therefore 
justified in assigning ranks to objects in respect to a given property. The 
property may not, in fact, be a unitary one and thus it may not be possible to 
rank the objects so that they have the same characteristics as ordered num- 
bers. The investigator is thus presented with two tasks: to order objects with 
respect to a given property or variable through the assignment of ranks, and 
to test (insofar as possible) whether the actual phenomena possess the order 
characteristics of numbers. The same two types of tasks are involved in the 
development of nominal scales. One can proceed to ^ve names to a number 

'* Axioms for simply ordered and partially ordered collections v.ill be taken up in this 
section. One additional type of ordering appbed by behavioral scientists is the ordered 
metric scale. See C. H. Coombs, Theory and Methods of Social Measurement.* in L. 
Festinger and D. Katz (eds.). Research Methods in the Behaiioral Sciences {New York: 
Holu Rinehart and Winston. 1953. pp. 477-«l). 


179 



NOMINAL AND ORDINAL SCALES 


ia.ii 


of categories, but it is quite another thing to determine v 


whether objects 


the property in question actually do 


another. If there arc borderline eases mat laii esutuy u 
or eases that fit equally well into more than one category, 
phenomena involved do not have the nominal charactcris ics 
the investigator hopes to impart to them. . nrooerty of 

In order to impart various eharacicrUlics of numbers to a gi' 
phenomena, it is necessary that the axioms of number 
property m question. Scaling procedures vary in the degree o axionis 

to testing this point. To understand this more fully, let us exam 
on which the ordinal characteristics of numbers arc based; 


Let C be any collcclion whose elcmcnis we call points •‘hinarv 

denoted by small Iciicrs x. y. z.a. b, . . . and let there be give 
relation" between the points of C which we denote by ^ ^ 


n the points 
I simply oril 

following axioms lef. E. Huntington. The Coni/nuiini m.i- 
Serial Order (Cambridge. Mass.: Harvard University Press, 

1 If X and y arc distinct points of C. then 
X <y and/or y < x. 

Z. If X < y, then x and y are distinct 
3. If X < y and y < a, then x < a.'* 

It should be noted that axioms arc stated in very general terms, 
the source of their great scope. We are then free to attempt to c 
various types of phenomena with them— i.c., test whether or jiot ^be ^ ^ 


Then C is called simply ordered, or bneorly ordered^ ^If'^oiher TiPW of 


hold true when they arc applied within a particular context. For example* 


may be any collection of numbers, of individuds with differcot heights, 
individuals who attach dificrcnl degrees of importance to a given go 
value. 


Each of the four types of scales (nominal and ordinal scales are 


discussed 


in this chapter; interval and ratio scales will be taken up in 


tempts to impart to a given property the characteristics of numbers 
by the previous scale. Additional characteristics of numbers, however, 
involved as well. Thus, the scales form a cumulative hierarchy in terms o 
charactenstics which they impart. In ordinal scales, the axioms become tn' 
statements unless there are several “distinct points of C,” for the funda- 
mental relationship precedes (<) is defined in terms of the concept distinct 
points. This concept is quite similar to that of the uniqueness of categor>e , 
which was necessary for the development of nominal scales. Thus, ordm 
scales are nominal scales as well, for they have distinct or unique points. 

Various theorems can be proved by using the axioms as well as the defim* 

lota 



Ordinal Scales 


Sec. 11 J] 

lions. Thus, e.g., we can prove the theorem that “If x and v are distinct points 
of C. then not both x <y and y < x hold” by utilizing Axioms 1 and 3 
From Axiom I we know that there are three possibilities: x < y, y < x. and 
both X < j- and >• < x. Axiom 3, howe\er, rules out the third possibility, for 
if it were true, it would result in a contradiction. To prove this, let us assume 
that both X < y and y < x. According to Axiom 3. if x <y and y < z. then 
X < z. Thus, it should hold that if x < y and y < x (substituting x for r), 
then X < X. But this is a contradictory statement because, according to Axiom 
2, if X < y then x and y are distinct. But if we substitute x for v within this 
axiom, then x < x implies that x and x are distinct, which is contradictory. 
Thus both X < >* and y < x cannot be true 

This, of course, is not the only theorem that can be proved on the basis of 
these axioms. One important theorem has to do with the possibility of estab- 
lishing a simple order among any given large number of points so that, for 
example, Xi < xj, x- < Xj, Xj < x< What is involved here is an exten- 

sion of the “transitivity” idea involved in Axiom 3 (if x < >• and y < z, then 
X < z) to any larger number of points. In attempting to determine whether 
these axioms actually do hold true for certain phenomena, it is very often this 
particular theorem that is tested. If the theorem does not hold, then (by 
implication) one or more of the axioms from which it was derived do not 
hold. Thus, for example, if we interpret point as basketball team and the 
binary relation precedes ( <) as invariably loses to, we may try to establish 
whether the theorem holds for any given ordering of C (ranking of the set of 
basketball teams). If the teams are ranked so that Xj < x-, Xt < xj, xs < X(, 
< Xs, . . . , then we might examine the data on actual wins and losses to see 
whether or not this theorem actually holds. Undoubtedly we would find many 
instances in which the theorem does not hold — e.g.. if Team Xi does not lose 
to Team x™ even on one occasion, or if Team xi docs not lose lo every other 
team in a perfectly consistent fashion. 

Some Options 

There are several possible ways out of this dilemma. They seem to boil 
dowm to two general approaches: one of them involves dealing only with 
phenomena that satisfy a stringent set of conditions; the other involves relax- 
ing the stringency of the conditions so that it is possible to deal with a wide 
range of data. The first approach is cognate to extreme operationism, in that 
both invoke some particular methodological goal and work toward its 
achievement as an end in itself rather than as a means to the achieve- 
ment of theory, explanation, and prediction. This stringent approach to 
scahng breaks down, as did extreme operationism, because it leads to an 
inability to apply any scale (even a nominal one) to phenomena with certainty 
that the axioms will hold. Fundamentally, we do not have the ability to 
validate any proposition about the real world with certainty. 



[Ch.ll 

NOMINAL AND ORDINAL SCALES 
In the broad strategy of a less stringent 
diflcrcnt orientations One of them is to be consena i\ ^ example, the 
scale one tncs lo Jcvclop for si'o" phenomena. [l,isStd 

axioms on which nominal and ordinal scales arc „,ion lend 

than those for inicrsal or ratio scales, researchers 
to be involved in the development of the former rat ter 
doing they can be more confident that the scale they P u focuses 
satisfy the relevant axioms. Another far less j difficulties 

on the development of interval and ratio scales in spite o Thus the 

involved in the validation of a more stringent set of axioms. ’ jijy 
conservative approach generally results in a greater num cr o jp,jt,\e 
tested assumptions about the nature of reality than the ^gfation 

approach. In a third approach, the researcher simply makes an 
of terms, such as precedes, more in accord isith actual it a$ 

rather than interpreting precedet as iVivofiab/y loses to, we 
fojM more than hall of the time to. In this way, it is j uy the 

phenomena that satisfy the axioms. A fourth approach is ulus ^ . r than 

orientation of Laiarsfeld with respect to latent structure analysis. 
interpreting the terms of the axiom system so as to be able ^j,y 

axioms more easily, one can make a point, for example, of *”'?* ..pfoach 
it is that more exacting interpretations do not hold. Thus, Laiarsfcl s ^ 
might be applied to an analysts of a •latent*’ classification of 
respect to ability, and theories incorporating probability laws might .^,^gj 
oped to explain and predict the relationship between these latent 
and the manifest data on wins and losses. 


Summated Ratings 

Many specific procedures for the construction of ordinal scales have 
developed, and among these is the summated scale. Let us suppose, for e 
pie, that having unearthed the factor of individualism as one of the impo 
values affecting behavior, we desire to rank individuals with respect to 
importance of this particular value for each. One simple procedure is to g ^ ^ 
a score of 1 for each "agree” answer and a score of zero for each disagt 
answer, as illustrated below: 


perior Q mon must stond oigno. 

■nan suho tionds alon« who exctlei out admit 
inletnol woild of Ideoli. of MtiMiva Icdmgs 
le, of selfAnowledgo, b man't tnio »iom«. 

It wou d be a men must bo a notvconformisl 
ividuolijt IS th« mon vdio h nost likely to 
r lha best rood too newfuiuro 


182 



Sec. II.3] 


Ordinal Scales 


In developing an overall score for a given individual, a logical procedure 
would be to sum the scores of that individual on each of the five items. Thus, 
an individual who agreed with al! the items would have a score of 5 whereas 
someone who disagreed with all would have a score of 0. This procedure of 
summing scores over a number of different items gives rise to the term sum- 
mated scales. It provides us with a means for ordering individuals in relation 
to one another. Ties will occur quite frequently in this example, however, for 
the only possible scores are 0, 1, 2, 3, 4, and 5, and there may be many more 
individuals than these six possible scores. 

This is a variant of simple order — ^namely, a partial ordering of individuals. 
The definitions and axioms relating to a partially ordered collection may be 
stated as follows: 

If C is any collection and ^ a binary relation between elements of C. then 

C is called partially ordered with respect to ^ if the following axioms hold: 

1 . For every element xotC.x^x. 

2. Ux^y and y^x, then x and y denote the same element of C. 

3. If X sS y and y^z, then x ^ z.** 

These axioms thus allow for large numbers of ties, for the binary rela- 
tion ^ may be interpreted to mean precedes or is tied with. In a simple 
ordering, by contrast, no ties may occur, for the situation would be the follow- 
ing: For any two distinct points, one must precede the other. In the exam- 
ple cited, individuals with the same total score are tied with one another, while 
simple-order relationships are designated among individuals whose total scores 
diller. 

As with any scale, one important question is not whether the numbers that 
are assigned to the different objects or individuals satisfy the axioms on which 
the scale is based, but whether (he underlying property that is scaled does so. 
This distinction is quite easy to overlook, for most individuals automatically 
tend to assign the characteristics of numbers to any items that are numbered. 
But how much is one quart, two pears, four planes, and seven ships? An 
answer of fourteen has little meaning because it does not specify any specific 
property being totaled. In the same way, let us assume that in developing a 
given summated scale, we happen to utilize several items that have nothing to 
do with the property measured by the remaining items but, rather, some other 
property. This may easily occur if the concept being measured is fairly abstract, 
and if no prior analysis (for example, factor analysis) was utilized to discover 
those items that are interrelated. As a matter of fact, most summated scales 
developed by social scientists are ad hoc in the sense that the items involved 
have not been subjected to any prior analysis. There is no way of determining, 
on the basis of this summated raring procedure, whether items measuring differ- 

" Ibid., p. 49. 


183 



(Ch.ll 

NOMINAL AND ORDINAL SCALES 
ent properties are incorporated into a ^ven summated 
the sotnmed scores for the different individuals would j fliis 

ing merely shows that numbers have the characteristics j of the 

was known before the investigation began It is only when "" 
designated properly has been achieved that something new 
Thus, an important limitation of this summated rating procc u unitary 

not in Itself provide evidence that the property under myes '6^ ‘ . jj 

one and that the axioms for partial order hold for this prop y 
scaled by this procedure. . different ways 

Another way of looking at this deficiency is to examine e 
in which any given score may be obtained. A score of 2, “disagree” 

produced m any of 10 different ways, “agree” on Items 1 ^ ^ 2, 

on Items 3, 4, and 5; “agree” on Items 1 and 3, and disagree 2 

4, and 5; and so on. If as many as 20 items were ji,jj„al 5 who 

total of 190 different ways of obtaining a score of 2. ihus. 
obtain a given total score in very different ways are treated as 
ranking on the property in question. They may in fact be 
seem to be quite a simple matter for the questions to vary imply 

another so that different combinations of them selected by indivi 
differences among these individuals on the property in question. 


Likert Procedures 

A mote elaborate technique for obtaining summated ratings has been 
oped by Likcrt.'t it may be outlined as follows: 

1. It includes five categories for responses, “strongly agree, ^gre ■ 
“undecided,” “disagree,” “strongly disagree.” 

2. Statements arc classified into those which are favorable and those ^ 

are unfavorable, and approximately the same number of statemen 
each type is utilized. i* 

3. For the favorable slatemeots, the weights given 10 “strongly 
“agree,” “undecided,” “disagree,” and “strongly disagree” are 4, ■ 

1, and 0. rcspecuvely. For unfavorable statements, the weights are U. - 

2, 3, and 4, respectively; thus agreement with favorable stateme 
and disagreement with unfavorable statements are treated as equiv 
lent. 

4. A large number of statements are administered to a group of resend- 
enis who are represenutive of those for whom the questionnaire 
being constructed. 

“■ "I”""" '"■'..M™/ IN,w Yorl: H»l>" 


1B4 



Sec. IIJ] 


Ordinal Scales 


5. The responses are analj-zed to determine uhich items discriminate best 
between the high-scoring individuals and the low-scoring individuals. 
Thus, for example, items to which high scorers and low scorers respond 
similarly arc eliminated. This is known as an item-analysis procedure. 


The Likert procedure for obtaining summated ratings provides some indirect 
evidence as to the existence of a partial order in the property itself. The item- 
analysis procedure increases the degree of homogeneity or internal consistency 
in the set of items. Although this provides no guarantee that only one properly 
is being measured by the set of items, it seems likely that it docs serve to 
eliminate many of those items that provide measures of difTercnt properties. 
It IS still quite possible that the hnal set of items involves a number of different 
properties. Thus, for example, a given item may be accepted after an item 
analysis, but it may be cither a cause or an ciTect of the property being meas- 
ured and may not constitute a measure of the property. 

Cumulative Scales 

Cumulative scaling procedures focus directly on testing whether or not the 
axioms for simple order hold for the property under investigation. To the 
extent that such evidence can be obtain^, the investigator can be more 
confident that he is dealing with a single or unitary property versus a mixture 
of properties. Let us return to the basketball example and assume that there 
are five teams: A. D.C.D. and£. If we now interpret < asimoriobly loses to, 
then we can specify which teams must lose to which other teams in order for 
a given simple order among the teams to hold. We may illustrate this by 
setting up a matrix in which the five teams appear as both rows and columns 
and in which any cel! represents the outcome of basketball games between 
the “row" team and the "column'’ team. Let us further specify that a plus 
appearing in a given cell indicates that the “column" team invariably loses 
to the “row” team, while a minus indicates that the “row" team invariably 
loses to the “column" team. Then, if one example of a simple order actually 
exists with respect to the property of losing or winning basketball games, the 
following outcomes will occur: 


ABC 



NOMINAL AND ORDINAL SCALES iCh. 11 

The scries of +’s in the first row indicates that Teams B, C. D, and E 
invariably lose to Team A. In the second row we see that Teams C, D, and E 
invariably lose to Team D and that Team B invariably loses to Team A (the 
— score), This latter piece of itiforraation is not a new statement, for the 
information in the first row mdkaicd die same thing. In fact, all of the minus 
( — ) entries repeat the informatjon transmitted by corresponding plus (+) 
entries. In the third row, v.c find that Team C invariably loses to Teams A 
and B, while Teams D and E invariably lose to Team C, and so on. No entries 
appear when a team is paired against itself. 

Wc may now examine these relationships to see whether or not the three 
axioms that lead to a simply ordered collection are satisfied. Axiom 1, 
as interpreted in this example, specifics that for any pair of teams — say x 
and y — x must invariably lose to y. or y must invariably lose to x, or 
both. The fact that the matrix is filled up with pluses and minuses indicates 
that an invariable loss relationship holds for any pair of teams (except for 
the blank cells, which do not represent instances in which two teams are in- 
volved). 

Axiom 2 specifics that if x invanably loses to y, then x and y are different 
teams. In looking at the matrix, wc see that all of the “Invariable loss rela- 
tionships'' involve difierent teams As for the blank cells, no invariable loss 
relationships are involved here, so it does not matter that the teams are not 
separate or distinct. 

Axiom 3 specifies a principle of transitivity if x invariably loses to y and y 
invariably loses to z, then x invanabJy Joses to c. Starting with the relation- 
ships among Teams A, B, and C, wc note that C invariably loses to B and B 
invariably loses to A, and we also note that C invariably loses to A. Examin- 
ing the relationships among D, D, and E, we find that E invariably loses to D, 
D invariably loses to B, and E invariably loses to D. Indeed, for each of the 
ten possible sets of three teams (ABC, ABD, ABE, ACD, ACE, ADS, BCD, 
BCE, BOE, CDE) this transitivity rclaiionship holds. 

Because we can show that the three axioms for a simply ordered collection 
hold with respect to the property of outcomes of games among Teams A, B, 
C, D, and E, we are able to rank these teams J , 2, 3, 4, and 5. respectively, 
with respect to this properly. Cumulative scaling provides the kind of data 
that permit the mvestigator to test each of the axioms that enable us to desig- 
nate a given collection as a simply ordered one. In this respect it differs from 
summated ratings, in which, at best, the evidence for the truth of the axioms 
is extremely partial and itufirect. It should be noted that the interpretation of 
the binary relation < need not be as stringent as the one adopted for ilius- 
trauve purposes i.c., invariaWy Joj^j /p. ^uch an interpretation were 
adopted, we would rarely be able to find a set of basketball teams for which 
the axioms would be true. An interpretation of <, which would permit us 
186 



Ordinal Scales 


Sec. 11.3] 

to find such basketball teams more easily, might be loses more than hall of 
the time to. 

Scale Analysis 

The major technique used to develop cumulative scales on the basis of 
items in questionnaires or interview schedules is called scalogramana^s,s or 
scale analysis— IS procedure developed by Louts Guttman. The kind of 
pattern that should be illustrated by items in a questionnaire in order tha 
may be able to rank them without violating any of the »ioms lot simple 
order is the same as the iUusttation provided tor basKetball teams In ft 
present example, we are dealing with a collecoou of items m a qucsuoimau^ 
Ld the property involved is feat symptoms. We shall intciptel 
relation < to mean, in the context Item 1 < Item 2 that ,/ any mM^a 
responds positively (e.g., yes versus no) to Hem I. he myanably responds 

'’“inftl'foUowftg matrix, positive responses arc indicated by + 

Uve responses are indicated by The rows represent ^ 

ems, each of which has a difterent pattern of responses. The columns tepre 
sent the dillerent items being tested to determine "I'ed.er or not a >m,de 
order exists among them. There are nine items, an ' . 
grouping of respondents follows a pattern of pluses an enlrilers- the 

on the response given to each item. The 

query, their physical reactions to the dangers ^ , a^simole 

3nd7r fire. The following items were tested as to the existence of a snnple 

orders 


1. Urination in pants. 

2. Losing control of the bowek. 

3. Vomiting. 

4. Feeling of weakness or faintness. 

5. Feeling of stiffness. 

6. Feeling sick at the stomach. 

7. Shaking or trembling all over. 

8. Sinking feeling of the stomach. 

9. Violent pounding of the heart.** 


-stoutrer c. .1. .r- Cl/., (or 

lilda While Riley, John W. Riley. Jr., and Jackson Toby, 

vtrui/>«j (New Brunswick. NJ.: actually 

••Thu scale of fear sjfnpwms is not just a *,”3* originally un 

found 10 form a scale according to the vmous cn > Stouffer et 

items were tested and one of them ('cold sweat ) did not tit into tne sea 
al., op. Cl/., pp. 13-14, 140-42. 



NOMINAL AND ORDINAL SCALES ^ 

We may now specify that the following pattern of responses would indicate 
that these items arc simply ordered: 


Groupings of 
Respondents 



Item Number 



, 

2 

3 

4 

5 

6 

7 

S 


+ 

+ 

+ 

+ 

+ 

+ 

+ 

2 - 


+ 

+ 

+ 

+ 

+ 


3 - 


+ 

+ 

+ 

+ 

+ 


4 — 

— 

— 

+ 


+ 

+ 



_ 

_ 

_ 

+ 

+ 

+ 



_ 

_ 

_ 


+ 

-h 


7 — 

_ 

- 

— 


— 

+ 

-h 

8 - 

9 - 

_ 




z 

" 





9 

+ 

+ 

+ 


+ 



We may examine these patterns of responses to determine whether or not 
the three axioms which lead to a umply ordered collection are satisfied. 
Axiom 1 specifies that, for any pair of items — say x and y— either * is + 
and y is +, or x is — and y is +, or x is — and y is — , or x is + and y is — • 
Thus, any combination of + and — for any pair of items satisfies Axiom 1, 
and all that is required is that every cell of the matrix has either a + or a — • 
Axiom 2 simply requires that if, for example. Item 1 < Item 2, then Items 1 
and 2 ate distinct. Because all the items are distinct in this example, this axiom 
is satisfied. 

To test the transitivity axiom, we may start by examining Items 1, 2, and 
3. We may note that Item 1 < Item 2, because (1) for the first grouping of 
respondents, individuals who rate Item I positively also rate Item 2 positively; 
(2) one cannot maintain that Item 2 < Item 1, for the second groupings of 
respondents rate Item 2 positively and Item 1 negatively. We may also note 
that Item 2 < Item 3, because ( 1) the first two Roupings of respondents rate 
both Items 2 and 3 positively, and (2) the third grouping rates Item 3 posi- 
lively and Item 2 negatively, indicating that it is not true that Item 3 < Item 2. 
We also find, in accordance with Axiom 3, that Item 1 < Item 3, because 

(1) the first grouping of respondenu rates both Items 1 and 3 positively; and 

(2) it is not true that Item 3 < Item I, for the second and third groupings 
of respondents rate Item 3 positivdy and Item 1 negatively. Thus the transi- 
tivity axiom holds for Items 1, 2, and 3. There are a total of eighty-four 
different combinations of three items, and it may be observed that the transi- 
tivity axiom holds for each of these. 

The response patterns }ust discussed are those that would indicate perfect 
consistency with the axioms for simple order. Even so, one would not be able 



Sec. IIJ} 


Ordinal Scales 


to obtajn a simple order for ihc respondents, assuming that more than ten 
respondents were insolved, since there would be one or more ties. However, 
a partial order would be obtalnaUe. Thus, the groupings of respondents num- 
bered from 1 through 10 may be used as a partial ordering for the respond- 
ents. Indisiduals within a gi^cn grouping are undiSerentiated with respect to 
response pattern, but individuals from different groupings are in an ordinal 
relationship with respect to one another. 

The chief contribution of scale analysis seems to be that it focuses attention 
on a direct test of the axioms for simple order. This is important evidence 
that the concept is a unitary’ one. Most scales that have been developed are not 
perfect, although there are some examples of perfect scales. “ Thus, for 
example, it is usually the case that a certain proportion of individuals within 
each grouping of respondents departs from what may be called ihe perfect 
scale type lot that grouping. Within scale analysis these departures are treated 
as errors, and a relatively arbitrary criterion is utilized to deiennine whether 
the proportion of errors among the total responses is too great to maintain 
that the property is scalable. Thus, with this scaling technique it is possible to 
reject the hypothesis that a series of items can yield an ordinal scale of a 
given property. In Guttman's terminology, this would indicate that the under* 
lying property or concept is not unidimensional. 

Although the Cuttman approach does actually lest the axioms for simple 
order, the question may be raised as to whether even the achievement of 
perfect scale patterns indicates conclusively that the underlying property is 
unidimensional — i.e., that not more than one property or concept is being 
measured. For example, the different items may be involved in cause-and* 
effect relationships. In addition, one may actually have a very poor sample 
of items with which to test the unidimensionaliiy of the underlying concept 
When errors occur in a scale, these questions become even more serious. 
Within the context of Stephenson’s work, an anempt to obtain and scale 
concepts that are unidimensional for a given group of respondents is Jess 
fruitful than procedures wherein the investigator explores the frame of refer- 
ence Of definition of the situation of a given individual. One critique some- 
times made of scale analysis is that the results may vary greatly from one 
group of respondents to another; a series of items that scales for one group 
may not for another. Of course, this apparent deficiency may be tunred into 
an advantage if it senes to direct inquiries into the factors that produce these 
differences. 

Another important contribution provided by the technique of scale analysis 
is that, to the extent that the proportion of errors is small, we arc able to 

See. tor ev2mple. vhe scates of Kawaiian vtd EvLimo linship tfnninology. with co- 
efficients of reproducibility of I 00. developed by Ira R. Buchler, “A Formal Accouni of 

the Hawaiian Eskimo— Tjpe Kinship TersiiDoioEies,' Souik>tesiern Journal of Anikro~ 

poJoS5. XX: 2SB-31S (19W>. 


18? 



[Ch 11 


NOMINAL AND ORDINAL SCALES 
reproduce the responses of a given individual on each of the items knowing 
only which grouping of respondents (or scale type) he is in. If we know, for 
example, that a given respondent is in the fifth grouping on the fear>symptoms 
scale, we can then predict with fairly good accuracy that individual’s response 
to each of the nine items. This compares very favorably with the summated- 
raling techniques, in which any ^ven score may be derived from many differ- 
ent combination of items 

The procedure for testing whether a given set of items is scalable has been 
greatly simplified since the advent of punchcd-card equipment for data 
processing It may be illustrated with hypothetical data for three of the items 
in the fear-symptoms scale: vomiting, feeling sick at the stomach, and violent 
pounding of the heart Wc may distinguish the following steps, assuming that 
these data have already been punched into cards and that responses are 
limited to two alternatives: yes (+) and no (~): 


I Obtain the distribution for the entire sample of -I- and — responses fo^ 
each of the items and calculate the percentage of positive responses for 
each item. 

2. Label the item with the smallest percentage of positive responses as ll 
that with the next larger percentage as 2, and that with the largest as 

3. Let us assume that the distribution was as follows; 


Rercentdse 

+ — Tefa/ + 


^ Iteml. Vomiting ' IJO 

, (temZ. feeling lick otihe Hamath tiO 
r Item 3. Violent pounding of the heart t80 


150 270 

no 270 

90 270 


44 

59 

67 


3. Start with Item 1 and sort the cards into those with positive and those 
with negative responses. 

4. Take those cards which were positive on Item 1 and sort them into those 
which are positive and those which are negative on Item 2. Do the same 
for those cards which were negative on Item I, sorting them into the 
positives and negatives on Item 2. In this way one obtains four separate 
piles of cards; those positive on both Items I and 2 (-f +), those 
positive on Item 1 and ne^tive on Item 2 (+ -) those negative on 
Item 1 and positive on Item 2 (- and those negative on both (- -). 


K®’** *** '•“‘**“7 have more than two alterna. 

•» ?ppo5»^y“cr ... 


190 



Sec. 11 J] 


Ordinal Scales 



191 



Sec. ]1J) 


Ordinal Scales 


For this example, the number of errors is equal to the sum of the fre- 
quencies of respondents in the various non-scale tjpes. Thus, the sum 
of 10, 10, 10, and 10 is equal to 40. Because there are three questions 
and a total of 270 respondents, «e have 

Coefficient of reproducibttity-)- t — ~ ^ 1 — .05 — .95. ; 

The rule widely used for determining whether or not the coefficient of 
reproducibility is high enough to satisfy the criterion of reproducibility 
is to conclude that there is evidence for scalability if this coefficient is 
equal to or greater than .90.** Thus, for the present example, we may 
conclude that evidence has been obtained for the scalability of the prop- 
erty being measured. 

8. Guttman postulates other criteria for assessing the scalability of the 
property or, in his terms, the universe of content. One of them has to 
do with the number of categories into which answers to a given item 
are divided. In the present example, each of the three items was dichoto- 
mized (divided into two categories), but Guttman mamtaics that it 
would be preferable to tricbotomize some of the items if there are 
fewer than ten. This would make it more difficult to achiese a coefficient 
of reproducibility of .90 or greater on the basis of chance. 

Another crilcrioo has to do with the percentage of positive responses 
for each item, which was calculated in Step 1. If perhaps half or more 
of the items result in percentages of between 40 and 60, this criterion 
is well satisfied. In the present example, where these percentages are 
44, 59, and 67, this criterion is satisfied. 

A third criterion has to do with the distribution of individuals among 
the various non-scale types. If any one of these non-scale types has an 
unusually large proportion of the total number of individuals with non- 
scale patterns of response, wc may question the existence of scalability. 

Finally, the number of items used to lest for scalability is quite important. 
Guttman originally maintained that the desirable number is ten or more. In 
actual practice, very few scales with as many as eight items have been devel- 
oped; four-item and fi\e-item scales arc quite, common. One approach de- 
veloped to deal with this smaller number of items so as to preserve intact the 
various desirable properties of the Guttman scale is known as H-technique.‘* 

** A technique for uLlizing a stau»uca) test to taVe some of the aibitraimess out of 
this procedure is discussed by Leon Feslinger. *The Treatment of QuaJitatite Data by 
‘Scale Analysis.’” Psychological Bulletin, XXXXIV: 14J-6J (1947). This lest also lak^ 
into account the number of items in'olved. See also Gunman's reply to Fesiinger in 
L. Guttman. “On Festinger’s Evaluauon of Scale Analysis,” Psychological Bulletin. 
XXXXIV: 45I-6S (1947). 

” S. SioulTer. Social Research to Test Ideas (New York: The Free Press of Glencoe. 
1962, pp. 274-289). 


193 



NOMINAL AND ORDINAL SCALES ‘ 

It combines certain features of Latent Structure Analysis and Guttman seeing, 
and it seems to have been uuliicd successfully in a variety of investigations. 
Another approach is to utilize the statisUcal test suggested by Feslinger -* for 
scales with few items, in order to determine the probability that a high coeffi- 
cient of reproducibility could have been obtained on the basis of chance alone. 

Thus far, with respect to scale analysts, wc have discussed the method of 
determining whether a given set of items satisfies the various criteria for 
scalability, but little has been said about the method for selecting these items 
in the first place. Guttman’s view on this subject may be inferred from the 
following: 

Scale analysis as such gives no judgmenr on content; it presumes that the 
universe of content is already defined. . it might serve as an auxiliary 
argument with respect to content in this special case where there is contro- 
versy over but one or two items of a large sample of items in which the 
remaining Hems are scalable . . Ibutl there must be a cogent initial argu- 
ment based on content if an item is to be classified m an area Sheer 
scalability is not sufiicient: an item may happen <o scale with an aua, and 
yet not have ihe content defining the area — ii may be a correlative rather 
than part of the definition.^* 


For Cuttman, the primary tack involved in the development of an initial 
series of items is a conceptual one, and unless this is done any mechanical 
procedure for locating items cannot be expected to do the job. Edwards and 
Kilpatrick have developed whal they call the scale-discrinunotion technique,^’ 
which combines some of (he techniques developed by Likert and Thurstone. 
(See Chapter 10 for a discussion of Thuntone's method of equal-appearing 
intervals.) The technique specifies that one start w-ith a large number of items, 
selecting only those about which one can obtain agreement among a large 
number of judges. 

A technique called inuUiple-scalogram analysis -* has been developed which 
aims at producing as many unidimcnsional scales as arc consistent with the 
underlying dimensions of the set of items, and providing more quantitative 
procedures for developing scales and testing for scalability. The procedure 
is designed for use in conjunction with an electronic computer and is able to 
handle large numbers of items with ease. 


L. Fesuneer, op. cK. 

S- Siouffer, el al . op cit.. p. 85. 

A.r “Scale ADaiysls and the hJeasurement of Social 

AUUudes, Pjycho/nfffita, XIU: 99~lt4 (1948) 

« J. C. UngMs, Mi^lliplf.Scalogfam Anatysa- A Gentrolnaiion of Gunman's Scale 
Unpublished doclorat dissertation, Michiean Stale Univtfsily, 1960. See also 
b L^ ’■ ‘-inBoes m Hamid Borko (ed.l. 

19^-95). Sciences (Englewood Clills, N.i.: Prentice-Kall, 


194 



Selected References for Further Study 
In this chapter we have discussed the two types of scales most widely used 
by social scientists: nominal and ordinal scales. The two tjpes of sc^es to 
be discussed in Chapter 12 utilize more of the properties of numbers and thus 
more easUy lend themselves to the testing of highly systematic or mathemati- 
cally stated theory. 

SELECTED REFERENCES FOR FURTHER STUDY 

Benjamin Fruchter, Introduction to Factor Analysis (Pnneeton, N.J.: D. Van 
Nostrand Company, 1954). Unlike much of the literature on factor analysis, 
this book provides an introduction to the subject that is understandable to the 
student with very little background in mathematics or statistics. Included is a 
brief treatment of the historical development of the technique, as well as those 
topics in mathematics that are involved in factor analysis. 

Robert McGinnis, Mathematical Foundations for Social Analysis (Indianapolis: 
The Bobbs-Merrill Company. 1965). This book was written by a sociologist 
and is designed to provide the student of behavioral science with the mathe- 
matics background necessary for understanding recent applications of math* 
ematics to behavioral science. The general topics are. variables and relations, 
numbers, functions on numbers, and measures of change and stability. 

Raymond L. Wilder, Introduction to the Foundations of Mathematics (New 
York: John Wiley and Sons, 1952). The foundations of modem mathematics 
are presented in a way which is intelligible to any student with the capacity for 
abstract thinking. The topics include the axiomatic method, set theory, the real 
number system, mathematical logic, intuilionism. formalism, and the cultural 
setting of mathematics. 


195 



Interval and Ratio Scales 


12.1 INTERVAL SCALES 
Introduction 

The dhtlnction between ordinal and inler>'al scales is that the latter have a 
constant unit Consequently It Is possible to perform the operations of adcU- 
tions and subtractions on Interval scales and come up with meaningful results. 
If IQ's were characterized by a constant unit, the difference between an IQ 
of 40 and 80 would be the same as that between 80 and 120. However, this 
does not in fact seem to be the case. The latter difference lies within the 
“normal” range, whereas the former distinguishes between an ability to cope 
with the tasks presented to most individuals and a general inability to do so. 
This illustrates the difficulties invtrfvcd in the development of interval scales 
even in areas where a great deal of research has been done. 

Approaches to scaling vary in the degree of attention given to testing 
whether the axioms of number theory hold for the phenomena under study. 
By chcxjsing to improve instruments for measurement before using ihern in 
testing propositions, one might be able to gain by an increase in the probability 
of correct conclusions. The losses might include the delay involved, and ffie 
possibility that a less cautious approach might produce the kind of information 
that could aid in the improvement of the scaling procedure. 

The medical student study provides an example of a formulation wluch is 
meaningful to the extent that interval scales of the key variables are available. 
This is because (1) the difference is taken between value scores and scores 
representing expectations for achieving the corresponding values, and (2) all 
of these difference scores are then summed. Thus, we have the operations of 
subtraction and addition involved in the calculation of expected value- 
deprivation. In that study the primary evidence advanced for the prince of 
interval scales was indirect The operations of subtraction and addition 
performed, and the meaningfulncss of the resultant (EVD) was parti y 
tested by determining its relationship to specialty preferences. Since close 

197 



[Ch. 12 

INTERVAL AND RATIO SCALES 

„lationshlps w«= found, a, Wuontl^ tha 

tor the meanmEtulness of the mattonatund J „„„„a 

SSSS‘S=fe»’"" 


properlies of Interval Scales 

For a fulkr understanding of interval scales. U « ^‘S^s^ion 

mathematical ptopertiea. It we nttun. to the '"j'^eients we 

of ordinal acalet, then we let C represent any collection 
call pointi. One set of properties which must I" meet 

is that represented by the axioms for simple order. Interval scales 
other requirements as well, but they do constitute “ 

positive inteeets in the real number system, for example, 
scale, but they also possess ordinal ptopeities— c.g., 1, i, a, » ■ ■ • t 
cedes 2, 2 precedes 3, 3 precedes 4 . . 

Interval scales also possess ceilaln properties assraated ,j,s 

lence relationship, which may be symbohxed by 

possessed by nominal scales, and Interval scales constitute “J/; 

ordinal scales, They were discussed informally m Section 11.2, a 
now list these defining properties or axioms of nominal scales, using 
matical noutioni 


1. If X is an element of S. then X = X (reflexiveness). 

2. If X = y, then y = x (symmetry). 

3. If X = y and y = 2, then x = 2 (transitivi^).* 

These axioms have to do with the decomposition of the collection C irito non 
overlapping classes. If, for example, classes do overlap, then the cquiva en 
relationship is not transitive (i.e., properly 3 does not hold). To illustra e, 
us categorize illness into the categories mental and physical, and let ® 
pret the relationship = to mean is in the same category of illness as an e 
us assume a situation where the categories of illness overlap, as illustrate o 
the following page. j v, - 

Assume that individual x is m £ (mental illness) ; y is in F (mental and p y 
sical illness) ; and z is in G (physical illness) . Then it is true that x = y (die two 
are in the same category of mental illness), and it is also true that ^ 
(they ate both physically ill). But it is not true that x = 2 (x is mentally 
but not physically ill, while z is pbyucally ill but not mentally ill). Of course, 

‘ Raymond L. Wilder, Introduction to ike Fouadalions of Aiathemaiics (New York. 
John Wiley and Sons. 1952, p. 47). 


198 



Sec. 12.tl 


Inien-al Scales 


f y ■ G z 


f: Mental iSInett 

ft Mental and phyeicol illnett 

Gs Phyiicol illness 

K,y,Zi individuolt 


if we have Don-overlapping categories, such as male and female, the transitivity 
property holds, as illustrated: 


Mt Mote 

M f ft Femol# 

{ z.y,xt IndK'Idwels 


1 

,-J 


Thus, the fact that x and y arc of the same sex, and y and z ate of the same 
sex, implies that x and z are of the same sex. 

An additional set of properties for interval scales may be stated as follows. 


1. If X = y, it is not true that x <y, and it is not true that y <x. 

2. lix = y, then if X < z, it also must hold that y < z. 

3. If X = y, then if it is not tree that x < z, it is also not tree that y<z. 


These properties interrelate the properties of simple order with those of 
equivalence, and they serve to further define the relationship of equivalence. 
If we substitute appropriate positive integers for x, y, and z, it is easily seen 
that positive integers possess the above properties. 

One additional set of properties which serves to define the property o 
addition is as follows: • 


* For this l^ung of properties, as w.U as some of the 
illustrative purposes, see Virpnia Seoders. Mejotirtme/tl and Siaiisttcs {Ne 
ford University Press, 1958, pp. 60-62). 


199 



[Ch. 12 

INTERVAL AND RATIO SCALES 

l'„ .hln 0* .o on= ot mo equivalM. 

,„a„.iti„, the ,c.ul. h .h.. ‘•““"■‘T r when 

'■ cqn'nk - ““■I to'cquars.'he inul.s an: equal, is known as .ho asion. 

, °' Tv+'j:! = (a + y) +!. Wha. is .nvnlvod hero Is an nssociame 
’ property. The order in which the operadons of addition take place 
no effect on the results. 

Properties 2 and 3 may be illustrated by ut'/ 

respect to Axiom 2, let x: represent the playing ability oj. 
represent the ability of Player X' According to the ^‘1^^“°!’ f p. „ y 
ties of the two players arc evenly matched. Now we may intro V 

o' . he scene. His playing ahih., y is hener than no ability whh«“™ 
does manage to hi. back the ball occasionally, bnt d is much ' jJ, 

of Player X or Player X'. Now. according io Property 2. '’'' ■’’“/I t t{,J 
of Player X' is inferior to the combined abilities of Players AT • y 
however, probably would noi hold croc In an aclual maleb for g 
would not add anything io Player X-s game and ‘‘““o 

detract from it. We can sec here that Property 2 is fundamental if ^ 

able to perform the operation of addition. Without this property, 
treat the playing ability of tennis players on an interval scale. 

To assess Property 3. let X. X'; P. and P' be four tennis players, and I 
their corresponding abilities be x. x'. y. and / Also, let us assume that pla^mg 
abilities are matched io accordance with Properly 3, so that x -- x' an f “ 
Now if we ariange a doubles match so that X and P arc paired ^8“'" . 

and P’, It does not follow that the two teams will be evenly matched, 
player interacts with his partner in certain ways, and one team may e 
more mutually compatible than the other. Our conclusion, thus, is that 
playing ability of tennis players docs not seem to constitute an interva sc • 
One scale possessing interval properties is the Centigrade temperature sea . 
The difference between I'C and 2®C is equivalent to the difference 
85°C and Sb'C. The equivalence may be established with reference to 
number of calories of heat required to change the temperature of a stan ar 
substance. Thus, it takes one calorie to raise the temperature ot water rom 
1 ° to 2°, and it also takes one calorie to raise its temperature from 85 o 
86°. Consequently, through the simple calculation of temperature changes y 
means of subtraction, we have a meaningful statement about the number o 
calories required to produce the change in temperature. Furthermore, we wi 



Intenal Scales 


Sec. 12.1 \ 

find that all ihe abo>e properties of internal scales hold for the Centigrade 
scale. For example, let us assume that vlc have four jars of equal size: X, 
X’, Y, and 1". Let each jar be half full of uatcr, and let the temperature of 
the water in the jars be 10*. 10% 90" and 90'C. respectively. These temper- 
atures may be designated as a, x', y, and >% and we thus have the equalities 
specified by Properly 3; a = j:* and y = y'. We now pour the water from Jar 
X into Jar K, and that from Jar X' into Jar Y'. According to Property 3, 
x-j-y = x' + y — i.c., the temperature of the water in the two jars (now full) 
should be equivalent This would in fact be the case, thus partially corroborat- 
ing the statement that the Centigrade scale of temperature possesses interval 
properties. 


The Method of Equal- A ppearing intervals 

Perhaps the method best known in behavioral science for developing scales 
with interval properties is the method of equal-oppearing internals.* Let ^ 
assume that we would like to develop an interv^ scale for the value of 
xidualism. This is one of the values that emerged from factor analysis (see 
Scctioa 1 1 .1 ). The general procedures involved in the method of equal-appear- 
ing inten'ols may be summarized as follows: 


1. The researcher chooses the concept or variable he would like to meas- 
ure. la this case, the value ot individualism. 

2. Oo the basis of his ideas about the coacept. he constructs (or gamers; 
many statements which seem to be closely related to the concept. In me 
original description of the method, for example, 130 statemen w 
used in dev eloping a scale for measuring attitude toward the church. 

3. The individuals selected as judges* rate each statement with reference 
to its degree of "favorablencss” or “unfavorableness. This was arco 
plished in the original study by giving each judge ® 

cards, each of which contained a separate statement. In a tion, c 


* L. U Thumoae and E. J. Chave. The Hrasu^emenl of Aiiuude (Chicago. Umversiiy 

“S A„.u, S. ccb. -n. V.,™ a/-; 

of Value Siatemeots." Department of Soaal Relalions. neces- 

Once a factor analj^.s is jTrfonncd and several factors «e isoUied l » 

pamcular. il may be important to develop an “^'^val scale of ih acpeanng mier- 

ingfrom the factor anal>sis. It should be noted t***' ‘he '"e**** ° ^ scry fcw of 

vals is not necessanly ued lo that of factor anal>^is. ^ ™ „ ,h» initial results of a 

those interval scales developed by lh» procedure were based oo the initial results oi 

SSlS''u.= oriBiul mdy ™ barf o« 300 .“‘“’“e* fJ^uSrf'S 

lh.1 d.u tan, « ftTX, H,«n jndsn. nut rftaU.n dnU nbumrf tam 

large numbers of judges. 



iCh. 12 

INTERVAL AND RATIO SCALES ^ 

jude= .a. si>tn a act ol clocn card, tonlamms the IcUco A xo k and 


JUWfc. -.aa - --- 

airangcJ in ihe foliovungoruci. 
UniGvatobl. 


Fovorobffl 


Each ,nJgc .a aaUJ to ao.t the carji conlainms the ^ ^ 

clctcn pLic. tcptcccntcJ by the Ictte.cJ cattlt. 'J. .b„„, 

placed ihc statements expressing the most .a J ,o be 

fhc concept bemg meatntej. UnJet Canl f (the "'J'*", "^“dte 
placed the neutral statements, uhilc under Card X arc to ^ p 
Lot fatotablc .lalenicnt,. Unde, the odtet letteteJ “ ^ „ 

placed the statements representing saomg Je£tc« of 

4. nertmcn'ti’obtaincJ I, on. .ho« Hge. aho 

roJed U.cit ta.1. catclcty ate etlminaied. togethe, a.th ““ 
meet. »h,ch teeei.c aidcly d,IIe.ent eating. Iron, dille.cnt ludgM. 

5 Each of the remaining statements is given a ^ale value co P? . 
to the median or middle position to »h«h il i* ^ r m 

Here. Pile A is designated as having a score of I ; Pile . 

and the scores of the remaining piles increasing to a score ot 

6. A firtal selection of about 20 to 22 items is made vhith the aim «f 
ing a set of statements spread out evenly along the scale 
exueme position to the other. Where a choice cxisu among a numocr 
of statements with approximately the same scale values, prctcrc 
given to the one obtaining the highest degree of agtcemcnl among 
judges. Thus, for example, an item placed into Piles D, h, F, C, an 
by equal numbers of judges uvuld be rejected in favor of one p a« 
only into Piles £. F. and G by groups of equal sire. At this umc 
investigator often constructs a duplicate form of the scale, uiiliimg t 
same cntciia of equal spacing and giving preference to items wim I c 
least dispersion among judges. This provides the investigator wit l c 
means tor testing the scale for reliability by correlating respondents 
scores on one scale with their scores on the duplicate form. 

7. A hypothetical example of how some of these items might look, to- 
gether with hypothetical scale values, appears on the following 
These items arc then used in research studies as a way of measuring the 
value of iniiiviJiialism for a &vcn sample of individuals. Thus, a sea e 
value for a given respondent is obtained by summing the scale v alucs o 
all the items to which he agrees, and then calculating his average or 
median scale value. Ideally, individuals will agree only with items which 
ate similar to one anotlicr. If an individual agrees with statements that 



Sfe.l2.l] 


Interval Scales 


vary widely in scale value, he may have quite ambiguous feelings with 
respect to the concept being measured. 


^ Scale ! 

Value < 

1.1 The suppreuion of one't own inilmduol will in order io adopt the will of so* j 

^ ciery is the way for mon to achieve hU noblest purpotet. | 

2.8 Man's true nature is best served when he follows the will of the majority. j 

j 4.S Oisagreemenl wnh one's friends is often foolish becouse it lends to couse or-gt^- | 

^ 6.1 Self-expression is valuoble, but it should not be unchecked if men is to enjoy ! 

the benefits of living in society. I 

7.5 Man's development of his own copocities should be on important goal for him. ‘ 
I 8.9 Frequent conformity to the wishes of others constitutes o betrayol of man's own ! 

identity. j 

10.4 Man's own individual self-development to the limit of his copocities is the major j 

, purpose of his existence. < 

Equal-Appearing Inlervab, Cutiman Scaling, and Summated Ratings: 

A Comparison 

In evaluatiag the method of equal-appearing intervals, it is useful to com- 
pare it with other scaling procedures. One of the most important criteria for 
evaluation involves the ordinal or interval properties of the scale produced. 

It will be recalled that scaling was defined as a procedure that imparts some 
of the characteristics of numbers to a given property of objects. This allows 
the investigator to make use of these characteristics in developing and testing 
his theory. With respect to the three procedures under consideration, it seems 
(hat only the Gultman procedure provides definite criteria for deciding 
whether or not a given scale has certain characteristics of numbers. 

One might also say that the Gunman procedure is located mainly within 
the context of justification whereas the others are primarily within the context 
of discovery. The Guttman procedure describes how data may be assessed in 
order to determine whether or not a scale exbts, but it gives little direction 
for the development of a set of items to lest for scalability beyond emphasiz- 
ing the importance of conceptualizing or defining the universe of content. The 
Likert and Thurstone procedures involve a set of directions that take the re- 
searcher from the construction of items to their final seiectlon for the scale, 
but they provide no test within the context of justification for ordinal or inter- 
val properties. 

Because no provision is made for testing within the context of justification, 
a given scale produced by the Likert or Thurstone method may or may not 
have ordinal or interval properties. One would have to devise additional pro- 
cedures in order to test for these properties. We might, for example, start by 
specifying that 90 percent of all respondents must agree only to those items 

203 



^ ICh. 12 

INTERVAL AND RATIO SCALES 

U... a,e adja„„. .o o„= a„o,hc, 

In this way. if wc know the scale value for a ” P . j^all be able 

faiily accurately those items uith he will ag . . ojejere 

to reject those scales that have somewhtU ambiguous , 

would he aualogou, .0 the rejeetion. in >hn 0“™“" 

,„lli.,so,wa.erj„^m.r__toms.lor^^ 

whose scale values, jr and ;c'. are equivalent. *!'® as a ioinily 

y'. We might define the operation of addition within this =0"^”* ^ \ 

agreed upon response of the individuals involved. Thus. \ + y 
sfnt the scale value of these ioint responses. Then, assuming 
vidual within a pair has an equal influence on the joint dccisio , 
determine whether or not it is in fact true that x + y-x + /• 

CTosely related to the evaluation of scales in terms of the p 
scncc of interval and ordinal properlies is what Gultman call* ^ m-asured 
sionality of the universe of content. If a senes of items “CtuJly m m 
several dimensions or properties at the same time, it usually 
possible to develop a Gultman scale indicating a partial order among rc po 
ents, let alone develop an interval scale. Of the three procedures under cot 
sideration, none provides a definitive lest for unidimensionality. althougn 
Gultman technique goes farthest toward providing the possible items 
constitute indices of the property in question. Thus, any set of items 
by a given investigator may not represent a fair sample from this univ 
content. Given this situation, the best that wc can do is to devise ct>t«ia 
will enable us to tell whether the “unfair*’ sample of items under considera 
seems to be unitary. Evidence to this cflcct is obtained in the Gultman sc in 
procedure whenever a high cocflicicnl of reproducibility occurs, indicating 
simple order among the items. Such a simple order would generally not appet^ 
if several dimensions were involved in the items. The Likert and 
procedures do not provide such a test, although the latter might be modi e 
so that a criterion — that a very high proportion of respondents must agree 
only to items contiguous in scale value — might be utilized to accept or rejec 
a given scale. In this way, evidence would be provided as to the interv 
characteristics of the items in a manner similar to that involved in t e 
Guttmanscalingprocedure. . . 

The three procedures might also be evaluated with reference to the indi- 
viduals to whom a given scale applies. In the Gultman and Likert procedures, 
no separate group of judges is involved. Consequently, these methods ate not 
subject to (he criticism that judges may see things dillerently from the actual 
respondents, Of course, a scale developed for a given set of respondents by 
♦ If ihis were actually underiakeD, it would be desirable lo match subjecis on the de- 
gree to which they make their influence felt in this kind of interpersonal situation. 



Ratio Scales 


Sec. 122] 

any of the three procedures may not have the same properties for another set 
of respondents. For example, a set of items scalable according to Guttman 
criteria for one sample may not be so for another sample. As a matter of fact, 
it may be true that each person’s idea of a ^ven concept differs to some extent 
from everyone else’s. Thus, it may be necessary to shift to the kind of ap- 
proach illustrated by Stephenson’s ^Technique, where the responses of each 
individual are analyzed separately. 

12.2 RATIO SCALES 

Ratio scales have all of the properties of nominai, ordinal, and interval 
scales. In addition, however, a ratio scale possesses an absolute zero point, 
so that multiplication and division become meaningful operations. One exam- 
ple of a ratio scale is the Kelvin scale of temperature. The zero point for both 
the Centigrade scale (the temperature at which water freezes) and the Fahren- 
heit scale (the lowest temperature Fahrenheit could produce — that of a mix- 
ture of snow and salt) is an arbitrary one. In the Kelvin scale, however, 
temperature has a direct interpretation in terms of the motion of molecules, 
and O^K is that point at which there is no motion whatsoever. Only with this 
type of scale is it meaningful to multiply or divide a given temperature. We 
cannot say that SO^C, for example, is three times as warm as lO^C, even 
though this scale does have interval properties. In the Kelvin scale, however, 
30° and 10° are directly translatable into the motion of molecules, which is 
three times more rapid at the former temperature than at the latter. Other 
examples of ratio sedes are measures of length, weight, and time intervals. 

Ratio scales enable the investigator to state relationships among variables 
as products or ratios. (Of course, this is of no use unless it is theoretically 
important to state refationships in this way.) For example, we might state 
that an individual's preference for a given event is a product of its utility 
for him and his expectations as to its likelihood. Thus, an individual may be 
asked his preference between being given $1.00 if the sun rises tomorrow 
or $50.00 if a penny turns up heads. Here, the financial rewards are related 
to the utilities of the alternatives, while the likelihood expectations involve 
the. individual’s subjective feelings as to the ptobabiluies of each. (This formu- 
lation will be developed in some detail in Section 17.3.) 

The proposition that one chooses between two events in this manner is quite 
similar to the expected value-deprivation hypothesis in the medical student 
study. There, Individuals were seen to make those choices that seemed to lead 
to a minimizing of expected value deprivation. 

The Measurement of Utility 

How are scales which possess ratio properties to be constructed? Galanter ^ 
provides an illustration of this procedure for the measurement of both utility 

’ Eugene Galanier. “The Direct Measurement of Utility and SubjecUve Probabiliiy," 
American Journal of Ps)chology, LXXV: 208--20 (1962). 


20S 



INTERVAL AND RATIO SCALES 


tCh. 12 


and subjecti^= probability. His ptocedunt for the mtasurement of ntdny is 
determine the relationship between utility and its moneta^ eqm ' 
strategy here is based on the view that tnone, already eonstrtutes a uniimen 
sional raUo scale. Thus, a determinaUon of the number of utiles 
respond to a unit of money might also give a uUlity scale ratio properties, 
technique involves giving a group of subjects these instructions: 


Suppose 1 were lo gi\c >ou. as an outright gift. $10.00. This $10. 
not come from me. hut comes from a foundation whose resources are 1 
less. You will be taLiag nothing away from me when 1 give you this > 
Presumably, this w lU maVe you happy to a certain degree. Now. 1 want y 
to think about how much money you would w 
happy as the $10 00 would make you fee!.* 


,t in order to feel twice as 


Responses ranged from $20.00 to $100.00, with a median of $45.00. Thus, 
for almost all the respondents, a doubling of “utility" involves more 
mere doubling of the amount of money. The same procedure w as repcaie 
two other groups of subjects, except that the base amount of money 
at $100.00 for the second group and $1000.00 for the third. The me t 
amount of money associated with a doubling of utility for these two 
was $350.00 and $500.00, respectively. On the basis of these results, a func- 
tion describing the relationship between utility and its monetary equiv en 
was calculated.* 


The Measurement of Subjective Frobability 

The measurement of subjective probability was accomplished in several 
diSerent ways. In one, subjects rated the likelihood of ten events — e.g., ^ 
would sunive an airplane crash; You can break a raw egg with a hammer. 
One procedure required subjects to rate each event on a 100-point rating sea e 
from 0-100. Subjects were to assign a 0 to those events they were absolute y 
certain would not occur, while 100 was to be assigned to events they were 
absolutely certain would occur. 

A similar rating procedure was utilized for another group of subjects, each 
of whom was asked to assign a number to each of a series of events. No 
limitation was placed on the aze of the numbers, but subjects were asked to 
represent the less likely events with smaller numbers and the more likely 
events with larger ones. In addition, subjects were told lo “assign the numbers 
to the events so that the numbers are proportional to the likelihood.” Thus, 
an event which is twice as likely to occur as another event is to be given a 
number twice as large. This is the method of magnitude estimation. 

The likelihood ratings obt^ed from these two procedures were generally 
quite similar, once the ratings obtained from the method of magnitude estima- 

* Calanier, op. cit., p. 211. 

•The funclioa is U = 3.71AI®**. whh 1/ expressed in utiles xnd At in dollars. 


206 



Sec. 12.2] 


Ratio Scales 


tion were standardized so as to make their range comparable to that involved 
in the 100-point rating scale. The difference between these procedures and 
the method of equal-appearing intervals is substantial; in the former, no inter- 
vening group of judges decides on the weights given to each item; the respond- 
ents themselves determine these wei^ts. In addition, the scales are not limited 
to a range of eleven units; they have a much wider range. This feature is 
especially important where items occur which, in the subject’s view, are very 
much different from other items. The magnitude-estimation technique in par- 
ticular allows the subject as great a range as he desires. An additional differ- 
ence between the magnitude-estimation technique and that of equal-appearing 
intervals is that in the former, subjects are directly encouraged to think in 
terms of ratios. 

Assessment of Ratio Properties 

Galanter used an indirect procedure for testing bis degree of success in the 
construction of scales of utility and subjective probability. It is an example of 
a construct-validation procedure. His basic proposition is that the probability 
of choice depends only on the difference in the expected utilities of the alterna- 
tives. Subjective expected utility may be calculated by taking the product of 
the likelihood of the event and its utility, the two scales which have been 
developed. Assuming that this proposition is correct, we would be able to 
predict the probability of choice of any given event. The predicted probability 
of choice could then be compared with the actual probability of choice of the 
event. Agreement between the two would provide indirect evidence as to the 
successful achievement of ratio properties within the scales. 

In order to carry through this test, it is necessary to devise a procedure for 
measuring the actual probability of choice of a given event. The method de- 
vised had to do with a set of paired events, between which subjects were asked 
to choose — e.g., being given $1.00 if the sun rises tomorrow or $50.00 if a 
penny turns up heads. The actual probability of choice of a given event was 
then computed as the proportion of times that it was chosen over the alterna- 
tive choice within the sample. This was then compared with the predicted 
probability of choice of the event, based on the product of the likelihood of 
the event and its utility. Results indicated a close relationship between the 
actual and predicted piobabiiiu'es for the set of events under consideration. 
Two things are thus achieved simultaneously — evidence is obtained as to the 
achievement of ratio properties for the scales of utility and subjective prob- 
ability, and additional conhnnatioti is provided for the proposition that the 
probability of choice depends on the difference between the expected utility 
of the alternatives. 

The magnitude-estimation procedure for the construction of ratio scales 
has recently been used to construct a scale of the relative gravity or serious- 
ness of delinquent acts. Subjects were asked to rate the seriousness of a num- 
ber of violations of the law. Some of the results of the study are; 


207 



[Ch. 12 


INTERVAL AND RATIO SCALES 

The most strongly supported conclusion 
is that all the raters, although constrained in their use of the mag 
assignments, tended to so assign the magnitude differences, 

ness of the crimes is evaluated in a similar way. without g 
by all the groups . . Because of the inherent ratio quality oj ^ 
tude judgments, the particular numbers used by the ^ pr^ented 

relevant, rather, it is the ratios of offense seriousness th P 

intact’® 


12.3 SOME SELECTED MEASUREMENT PROCEDURES 


Smgle-Iiem Measi</-e/Menf . . 

Most of the measures used by behavioral scientists are 
items. For example, single items were utilized in . {qI. 

rfort for measuring occupational values in the u^iized 

lowing coosiiiute some examples of these items, each of which 
to measure a separate value: 


What Importanee Would Eoeh of the Following Things Hove 
in Your ideol Job? 


fndis< 

pensable Extremely Very Im* 
in a Job Important portant 


tittlo j 
Fertly otNo 
Impor- Impor- 
font J 


o. Developing vrarnl 
personal rela- 
lionships with 
potients 

(. Having prestige 
omong my col- 
leogues in the 
medical pro- 


le. Having the chonce 
to help people 
1. Hoving the chance 


! 


tinually my under- 
standing of bosic 



'“Thorsten Selim and Marvin E. Wolfeang, The Measurement oj Delinquency (New 
York: John Wiley and Sons, I9M, p. 268) 

208 



Sec. 12.3] Some Selected Measurement Procedures 

One advantage of single-item measurement is the speed with which it en- 
ables research to proceed. Rather than focus attention on the development of 
multiple item scales, the investigator is free to concentrate on the develop- 
ment and testing of a variety of propositions. The popularity of single-item 
measurement is thus readily understandable, for most investigators prefer to 
get through with the business of measurement as quickly as possible. One 
justification for this procedure is that, as a result of developing and testing 
propositions, the investigator may glean new ideas about the concept being 
measured and these may lead to substantial improvements in the measuring 
mstrument. In addition, the investigator is free to measure a wider variety of 
concepts during a given interview or questionnaire session. A Thurstone equal- 
appearing interval-scale, for example, would require a total of 20 to 22 items 
for each concept scaled in this manner. The investigator must also take into 
account the factor of the boredom of the respondent; this was a frequent reac- 
tion by respondents to those parts of one interview schedule designed to 
furnish the raw materials for a number of Guttman-scale analyses.” 

A final argument which may be put forward in defense of single-item meas- 
urement is that it may not be inferior to multiple-item measurement for many 
research situations. For example, in the medical student study, single items 
from the structured-values question were used to measure the occupational 
values of respondents. An additional procedure for measuring these values 
used responses to the careers-value question. Each value was measured by 
three separate items, and a Cutimao-type scale was developed for each set 
of items.'^ It was then found that the Gutunao-type scales led to almost ex- 
actly the same results as the single-item measures when it came to predicting 
the respondents’ preferences among medical fields. This does not mean that 
the scaling procedure was completely useless in this instance, for it provided 
some evidence for the unidimensionalily of the different values that formed 
scales. 

The disadvantages that may be attached to the use of single-item measure- 
ment are numerous, although they are uot unique to this type of measurement. 
Every item in a questionnaire must measure something, but it may also get 
at several things at the same time. A given item may mean quite different 
things to different respondents, or one thing to one grouping of respondents 
and something else to another. Also, the Afferent response categories may 
vary in meaning among respondents. One person’s notion of what is indis- 
pensable in a job may be equivalent to another person's conception of what 
is extremely important in a job. 

Some of these difficulties may be at least partially resolved by certain 
multiple-item techniques. Thus, for example, GutUnao scaling provides some 

*' Bernard S. Phillips. Technical Afaserials for on Interview Suriey of the Aging {Ur- 
bana. III.: Small Homes Council — Building Research Council. 1962). 

Bernard S. Phillips, "Expected Value-Deprivation and Occupational Preference," 
Socio/nerry. XXVII: 151-60 (1964). 


209 



INTERVAL AND RATIO SCALES 

evidence for the unidimensionality of the concept for 

generally would not be able to obtain a coefficient of reproducibility ot . 

higher unless a fairly high degree of uniformity existed among the ' 

ents as to the meaning of each item, as well as to the different response 

gories for each Item. r Wo than 

In addition, multiple-item measurements are generally more reliaoie 
single-item techniques. The question of validity, however, is somew a 
complex, On the one hand, the magnitude-estimation procedure used Py 
anter seems to have resulted in some validation of a multiple-item sea e 
measure of subjective probability. The greater the number of items in a 
scale, however, the greater is their potential for measuring more an o 
factor for a set of respondents. 


Semantic Differential 

A method that attempts to gel at the underlying meaning which a gi«n 
concept has for an individual or group has recently been developed by Osgo > 
Suci, and Tannenbaum.'* Subjects are asked to rate a given concept— 
surgeon — by locating it between two polar adjectives. The space between 
adjectives is divided into seven units, indicating closeness of meaning to eac 
of the adjectives. One of the questions utilizing the semantic-differential ap* 
proach in the medical study was the following; 


SURGEON 

' 

Colorful !!■.:] 

, Celerleti 

Feminine ’ ’ . 

. Moseuline 

Outgoing ’ . , * < < > 

. Self-centered 

Awful « 1 , , 

1 Nice 

; > > [ x : 

1 ,. Sef’ 

' Ordinary * : s s • 

: _ Individualistic 

Introverted , , , . 

_ _s ^ Extroverted 

■ Pleosont 1 , j , . s s , j 

; Unpleosont 


’’Charles E. Osgood. George J. Suci, and Percy H. Tannenbaum, The Measurement 
ol Meaning (Urbana, lU.; University of IlUnois Press. 1957 ). 

210 



^23] Some Selected Measurement Procedures 

This technique may be used to compare the meanings the different med- 
ical fields have for respondents. For example, it might provide clues as to 
^vhy some individuals rated general practice favorably in spite of a high 
degree of expected value-deprivation for that field. The analjtic procedures 
involved in such comparisons arc quite simple. In this instance one might 
compare those medical students favorable to general practice and high on 
expected value deprivation for the field with students who are unfavorable to 
general practice and are also high on expected value deprivation for that field. 
A difference between two individuals on a given pair of polar objectives is 
measured by the number of categories by which their ratings differ. The maxi- 
mum difference is 6 (because there are only 7 categories) and the minimum 
difference is 0. One group’s ratings may be compared with that of another 
by comparing the medians of the two. 

Osgood and his co-workers, on the basis of a series of factor analyses, have 
emphasized that the meaning of concepts for respondents seems to involve 
evaluation, potency, and activity. Individuals tend to judge or evaluate con- 
cepts according to such polar adjectives as good-bad, nice-awful, and beautiful- 
>tgly. Potency may be indicated by strong-weak, large-small, and heavy-light, 
whereas activity may be indicated by fast-slow, active-passive, and sharp-dull. 
In the question utilizing the semantic differential, the adjectives awful-nice 
and pleasant-unpleasant cefiect evaluation; feminine-masculine and hard-soft 
reflect potency. Factors other than evaluation, potency, and activity were 
isolated by Osgood and his associates, and one of these is receptivity, meas- 
ured by colorful<olorless, or (in other cases) sensitive-insensitive, savory- 
tasteless. and interesting-boring. The remaining pairs utilized in the example 
(ouigoing-self<entered, ordinary-individualistic, and introverted-extroverted) 
were thought to be of special interest in the problem under investigation. AH 
of them concern sociability, a variable that seemed to be of some importance 
in trying to understand why students might or might not be interested in vari- 
ous medical fields. 

The semantic differential seems to provide a path toward a better under- 
standing both of the measurement situation within the research process as 
well as the individual’s fundamental expectations as to the meanings of con- 
cepts. Within the research process, subjects or respondents are often bom- 
barded by a variety of verbal stimuli, and it is often assumed that the mean- 
ings attached to these stimuli are relatively uniform. Such an assumption can 
be tested by the semantic-differential approach, and the meanings assigned by 
a given miffviduaf may be diflecmtiated Icvm those assigned by any other. 
The medical student study provides an example of the use of this method as 
a means of getting at the individual's fundamental expectations. The method 
itself is more general than the specific factors isolated in the initial studies. 

It promises a far more thorough and detailed understanding of the symbolic 
aspects of human behavior than has jet been attempted or attained. 

211 



INTERVAL AND RATIO SCALES 


[Ch. 12 


Social Relationships 

Some specialized techniques bear on concepts of cruml 
behavioral science: social relationships and expectations, ne 
niques, the sodometric test}* is concerned i.ith f 
relationships. An example of this approach is provided by the m 
dent study. Some of the questions administered to the students were. 


1. Whom do you like best? 

2. Who knows most about medicine? /-.t some 

3. Whom would >ou like most to represent medical students l 
meeting)? 

4. To whom do you talk most about your plans for the future/ 

5. Who knows best where he is going? 

6. Who has the widest range of interests? 

7. To whom would you go for advice on a personal problem? 

8. Who will make the best doctor? 


These questions were designed to aid in the assessment of medical s u 
influences on one another with respect to preferences for the various 
fields One of the first steps was to find out whether the preference pa 
of more popular students differed from those of students who were less ‘ 
Students were divided into popularity groups on the basis of how o ^ 
their names were mentioned by other students in response to Questions , « 
and 7 above — the items that seemed to provide the best indications of ac u 
social relationships among students. Results indicated that within the htg 
popularity group the preference patterns of freshmen, sophomores, juniors, 
and seniors were similar. Among the low-popularity group, however, an 
important change in preference patterns occurred between the freshman an 
junior years — freshmen tended to agree very little on their preferences for t e 
medical fields, but this degree of agreement increased during the sophomore 
and junior yeare. The inference was drawn that these low-popularity students 
are exposed to less peer-group influence than the more popular ones, an 
consequently are slower to reach a common standard of agreement on their 
preferences. 

This analysis represents only a fragment of the vast amount of data pro- 
vided by this one medical school on student interrelationships. For example, 
the patterns of social leiationships of a given student have yet to be related 

'• For a discussion of ihe oripn of ihn technique, see J. Moreno, Who Shall Survive? 
(Washington, DC.; Nervous and Menial Disease Publishing Co, 1934). For a more 
recent discussion see Gardner Lindzey and Edgar F. Borgatta, "Sociometnc Measure- 
ment,” in Gardner Undiey (ed.). Handbook of Social Psychology. Vol. I (Cambridge, 
Mass.: Addison-Westey Publistung Co , I9S4). pp. 405-48. 

212 


Sec. 12 J] 


Some Selected Measurement Procedures 


to his values and expectations. Such anaij'ses may yield a greater understand- 
ing of the dynamics involved in choice of medical field. 

An approach that may be utilized in conjunction with an analysis of social 
relationships is agreement analysis,^^ a technique that requires an electronic 
computer, ft is possible to determine with which persons a given 
individual is in agreement on a series of items. For example, the voting pat- 
terns of United States senators may be analyzed in this way. The result will 
be a number of partially overlapping groupings of senators who voted simi- 
larly on a large number of issues. Each grouping constitutes a type, and 
research may then proceed as to the origin and impact of such types. Many 
senators will appear in a number of types, but each type vdU be based on a 
diSerent set of votes. The set of votes corresponding to a given type may be 
called the pattern. This kind of basic data on types and patterns can then be 
combined with an analysis of the patterns of social interaction among senators 
for a more complete understanding of the dynamics of social interaction. 

Expectations 

A theoretical analysis of the relationship among expectations, norms, and 
roles prorided by Rommet\'eit is based on the idea that expectations are 
both “sent*' and ‘'received'’ by individuals. Thus, ego expects alter to do X 
(sent expectation), and alter understands what ego expects (received expec- 
tations). Most expectations are supported by direct or indirect rewards or 
punishments. A gi%en individual may or may not “receive” whatever was 
''sent.'' In any case, he acts on the b^is of whatever it is that he “receives.” 
His behavior or choices may or may not conform to these expectations, de- 
pending on the strength of the attendant rewards and punishments. When 
expectations are recurrent, we may speak of a social norm, and a social role 
may be thought of as a system of norms directed at a given category of indi- 
viduals. 

Within this theoretical framework, it would be desirable to obtain three 
pieces of information: the expectations a given individual sends out, those he 
receives, and his behavior in the context of whatever expectations he receives. 
An example of bow these different kinds of data are obtainable is provided 
by the structured-values question in tbe medical student study. Within this 
context, tbe expectations we are dealing with involve tbe fundamental occupa- 
tional goals or values which a medical student should or should not have. The 
three pieces of data may be obtained in the following ways: 

1. Expectations received. The student can be asked directly about the way 
other medical students would expect him to answer the series of ques- 

'*L. L. McQuiily. “Agreeement Analysis: Ctasstfviag Persons by Predominant Pat- 
terns of Responses," Bniuh Journal of Statuttcal Pijchologj, IX: 5-16 (1956). 

'• Ragnar Romraeiveit, Social Norms and Rotes (Minneapolis: University of Minne- 
sota Press. 1955). 

313 



[Ch.l2 

INTERVAL AND RATIO SCALES 

lions on occupational values. But because in our s«iety, 

sis on individualism and free choice, many individuals 

to admit being subject to such expectations, a more eSecU d 

is simply to Inquire about a gi«m respondents 

importance attached by other medical students in his c “ 

values. Whichever approach is used, it is often quite "ii®™ 

exactly what kinds of sanctions arc employed to back up thes per 

tions, for they may be quite subtle. . 

2. Expectations sent. A direct method might involve questions 
respondent would expect other medical students in his c ° 
importance of the various values. The expectations sent y 
medical student class could then be calculated by takmg m 
those sent by each member of the class. An indirect approac 
involve obtaining each student’s rating of the importance ® 
attaches to each value, and then to use these data for al s u 
infer the expectations sent out by the medical student class 
3. Actual behavior. The actual behavior or choice of a given stu e 

respect to the importance of a set of occupational ^ ^ 

simply his own ratings of the importance of these values. This can 
be analyzed in relation to the expectations sent and those receive . 


The entire indirect approach can be managed in a single quesuon 
several parts. The following is a format developed for the medical stu 
study which can be used to obtain all three types of data: 


Most people have some idea of what they would want in an ide^ . 
— i.c., if they could dream up a job that had all the things they like. 
importance would each of the following ihin^ have in your ideal job/ 
a ^eck in Column A, B, C, D, or £. 

Aside from refernng to things that may or may not be important to 
personally, these statements may refer to things that are important to other 
students in medical school. What importance do you think these things gen* 
eraliy have to other students in your class? Put a check in Column F, C, or ti- 

You will work better on this question if you work first on Column A 
through £ on all questions and then come back to work on questions f 
through H. 

It should be noted that in this approach the investigator may select any 
grouping he believes to have an influence on the behavior of the individuals 
under investigation. In this case, the medical school class was used as the 
relevant grouping, but research nught equally well proceed with respect to 
the student’s friends within medical school. These may be determined on the 
basis of sociometric analysis, and their responses might then be utilized to 
measure the expectations sent. 


214 




SELECTED REFERENCES FOR FURTHER STUDY 

Charles E. Osgood. George J. Suci, and Percy H. TA^!NE^BAUM, The Measure- 
men! of Meaning (Urbana, IlL: Univeisity of Illinois Press, 1957). This book 
is the result of an attempt to develop an objectise measure of meaning. The 
semantic-differential technique, the outcome of these efforts, b discussed in 
terms of the general theory on which it b based, the specific procedures involved, 
and a variety of problems to which it has been applied. 


215 





INTERVAL AND RATIO SCALES 


[Ch. 12 


Thorsten Sellin and Marvin E Wqlfcanc, The Measurement of Delinquency 
(New York. John Wiley and Sons, 1964). The research reported here repre- 
sents one of the very few applications of the technique of magnitude estimation 
to sociological problems in the eflbn to construct a ratio scale. There is a good 
deal of discussion of the general rationale for this technique, although most of 
the volume deals with the specific problem at hand. 

Warren S. Torgersqn, Theory and Methods of Scaling (New York: John Wiley 
and Sons, 1958). This rather technical volume would be of use primarily to the 
more advanced student of methods who has some sophistication in mathematics. 
A very wide range of scaling procedures is discussed and evaluated. 


216 



But the consensus of the scientific community m a given ‘n^'^n J 

wrong. One of the most important safeguards against a tyranny of t ^ ^ 

is found in the presentation of scientific evidence. Good 
require that the scientist malce clear the decision-makmg scientists 

which evidence is weighed and concluaons drawn. In this way, 
can look at the results of any particular investigation and separa 
data from the conclusions drawn by the investigator. Consequenuy. J 
scientist is free to give different wei^t to the evidence and dr 
conclusions — as long as he, in turn, makes clear the jKOcess y pJ 

arrives at a differential assessment of a particular set of data. As a 
fact, such differences are often regarded as quite fruitful, for they ^ j^g 
up the need for an additional study of a particular type m order to 


the controversy. j-vAlooed 

Chapter 13 discusses some of the procedures which have been 
for the non-statistical analysis and presentation of data about one 
variables. Chapters 14 and 15 outline some of the basic principles o . 
and present some illustrative statistical procedures. The aim here is ° ^ 

size the fundamentals of probability and statistical decision making. 

16 deals with the analysis of more than two variables within the con 


the search for cause-and-effect relationships. 


218 



One and Two Variables 


13.1 ONE VARIABLE 

Descriptive and Cause-Effect Analyses 

One-variable analysis is descriptive of a gives entity by taking into account 
only one variable at a time. Tbus, a number of variaWes may be involved so 
long as the investigator does not interrelate one with another. What is achieved 
in this way is an initial description of the characteristics of the entity under 
consideration. For example, let us assume that a sample of individuals is 
drawn from a given community. Some of the facts resulting from an initial 
one*vafiabIe or descriptive analysis might be the following; 

1. Sex: SS percent, females; 45 percent, males. 

2. Race: 87 percent, white; 13 percent, Negro. 

3. Age: 40 perrent. 0-20; 31 percent, 2J-40; 29 percent, 41 or over. 

4. Reli^on: 35 percent, Protestant; 44 percent. Catholic; 21 percent, 
Jewish. 

5. Income: 33 percent, under $4000; 28 percent, $4000-56999; 26 per- 
cent, $7000-59999; 13 percent, $10,000 or over. 

6. Education: 88 percent, high school or less; 12 percent, beyond high 
school. 

Although six variables are described above, none is interrelated or cross- 
tabulated with any other by the researcher. Thus, a description of the sample 
with respect to sex, race, age, religion, income, and education is provided, 
but wc do not know, for example, what proportion of males are white or what 
proportion o[ lemaics arc school ^adaates. Sercb descripth e analyses 
are valuable in the social sciences chiefly as a prelude to two-variable and 
multivariate analysis. This emphasis, in part, explains why the term hypothesis 
is used primarily in the context of statements about the relationship between 
two or more variables. It is conceivable to use the term hypothesis in the 



lCh.l3 

ONE AND TNVO VARIABLES 

context of one vathble, but such ttsnsc is rare. 1°' "i,‘ ”m4 

have hypothesiaed that Johnson would wtn over ,i,ieh 

Presidential election. The single variable here is *1’= Lcnlist is gea- 

is categorized under the didercnt candidates. But e s promise 

crally not interested in doing research on hyTotheses What 

of leading to an understanding of the m«hanisms of intrinsic interest 

will happen in a particular place at a particubr lime ^ hypotheses 

to many individuals, but the social scientist centers his interest o yp 

with far wider scope. j >,ai haopens 

This does not mean that social scientists arc uninterested 
in American presidential elections. Such interest is frequen y wT’^ertheUss, 
pie, among political scientists, sociologisu. and psychologist. 
interest centers more on the question of nhy a given . .^ 55 , 

is happening, or will happen than whai jhc dynamics 

tions constitute explanations that may shed considerable lig ^ 

of behavior In studying a given presidential election, for ex^ple. . , j-g 
a great deal about the general mechanisms which arc informatioo 

the political expectations and values of human Sue ^ 

might follow in the wake of such a questions as “Why did the m * . { 

voted for Goldwater do so?” The mere fact that a cettam 
Americans voted for a given candidate in a given election does no g 
far toward explaining human behavior in general. . . viewed 

Of course, we have to start somewhere, and description is gj 

as an important beginning. We cannot proceed to study why a ' . 

voted for Goldwater unless we know whether or not he actually dni v 
Goldwater. Descriptive data constitute an important prelude to ^ause-ai 
effect analyses. If we are ignorant of the characteristics of a given P^P*^ ^ 
we may have considerable ditBcully in formulating hypotheses abou ^ 
behavior. A knowledge of its characteristics with respect to sex, race, ag , 
religion, income, and education, for example, may be highly suggestive. 

In a survey, a choice usually does not have to be made between a desenp 
tive analysis and a cause-and-eflect analysis. A typical procedure in 
analysis involves obtaining the “marginals” or frequency distribution of 
respondents on each question and then proceeding to two-variable and mu i 
vanate analyses. Thus, for example, the number of individuals of each sex, 
religion, and so on, would be tabulated. This would usually be followed by n 
percentage breakdown for each of the questions (to determine the proportions 
of male and female, and so on). Once this is accomplished, the results woul 
be used as an aid to the development of a cause-and-etfect analysis. 

With the advent of electronic data processing, such descriptive analyses 
take very little time. The most time-consuming task is not the processing of 
data by machines, but the planning procedures of the investigator as well as 
his analysis and interpretation of the processed data. Thus, all the frequencies. 



Sec. 13 J] Quahlalhe Analysu: Conceptualizauon of Variables 

as well as the percentages, in response to each of the questions in a detailed 
sursey of a large sample can be calculated and printed by an electronic com- 
puter in an hour or so. 

Historical Studies 

The distinction between descriptive onc-vanable analyses and cause-and- 
effect analyses may also be illustrated in a historical context. Weber’s Protes- 
tant Ethic and the Spirit of Capitalism ' is concerned with a historical fact — 
the origin of capitalism. His focus is not, however, on a desenption of the 
various pieces of evidence for the existence of capitalism at different periods 
in different countries; rather it is the identification of those factors or variables 
which seemed to be associated with the existence of capitalism. What is 
involved here is the relationship among two or more variables, one of which 
has to do with evidence for the existence of capitalism — c.g., a relationship is 
established between one variable (the Protestant ethic) and another (the 
accumulation of capital). A descriptive analysis would deal with each of 
these variables separately. It is quite differem. however, to interrelate the 
two variables — to compare individuals who believe in the Protestant ethic 
with those who do not in terms of their degree of accumulation of capital. 

The question might be raised whether a historical fact such as the rise of 
capitalism should be of interest to the scientist Is it not true that such an 
occurrence was relatively unique and is not likely to be repealed? If so, of 
what possible utility can an undetslandiog of this occurrence be to the scien- 
tlsl who is interest^ in understandings that ara not very limited in time and 
space? In answering this question, it is important to understand that in one 
sense every event is unique to some extent — none will ever be repeated in 
exactly the same way or in the same context. But there t$ also similarity or 
correspondence among events, and one of the tasks of the scientist is to 
explore these similarities. In particular, although capitalism may never rise 
again in the same way, certain similar values will in all probability continue 
to affect the personal decisions of individuats as well as the economy of 
nations in every era. 

13.2 QUALITATIVE ANALYSIS: CONCEPTUALIZATION 
OF VARIABLES 

Migration to Skopje: A n Example of Coding 

Whereas factor analysis constitutes a technique [or developing concepts, 
a technique fairly involved mathematically, coding procedures for open-ended 
questions are qualitative in nature and have the same purpose. The present 

' Max Weber, Proieslanl Eihic and the Spirit of CapUahstn, iranstated by Talcott Par- 
sons (t.ondon: George Ahen and l/nwin. t930). 


221 



ONE AND TWO VARIABLES 

conlext of one variable, but such usage is rare. the 19^4 

have hypothesized that Johnson would win over ^ tvhich 

Presidential election. The single variable here is the vote P t,. 

IS categorized under the different candidates. But the ^ 

erally not interested in doing research on hypolheses unless ^ 

of leading to an understanding of the mechanisms of hum . j-tjrest 

will happen in a particular place at a particular time may ave i - theses 
to many individuals, but the social scientist centers his interest on yp 
with far wider scope. . . , hanoens 

This does not mean that social scientists arc uninterested m w 
in American presidential elections. Such interest is frequent y ’ ^.|gss, 

pie, among political scientists, sociologists, and psychologists. . .-jned, 
interest centers more on the question of why a given event as 
is happening, or will happen than whai occurs. Answers to t esc 
tions constitute explanations that may shed considerable light on ® 
of behavior. In studying a given presidential election, for examp e. we „ 

a great deal about the general mechanisms which are effect’''* -n-ation 
the political expectations and values of human beings. Such . 

might follow in the wake of such a questions as "Why did the mdivid 
voted for Goldwater do so'>” The mere fact that a certain propo 
Americans voted for a given candidate in a given election does no g 


far toward explaining human behavior in ^neral. . j 

Of course, we have to start somewhere, and description is 
as an important beginning We cannot proceed to study why a given in i 
voted for Goldwater unless we know whether or not he actually did vo 
Goldwater Descriptive data constitute an important prelude to cause- 
effect analyses. If we ate ignorant of the characteristics of a given popu atio 
we may have considerable difficulty in formulating hypotheses about 
behavior. A knowledge of its cbaractenstics with respect to sex, race, ag > 
religion, income, and education, for example, may be highly suggestive. 

In a survey, a choice usually does not have to be made between a descrip 
live analysis and a cause-and-effecl analysis. A typical procedure in survey 
analysis involves obtaining the "marginals” or frequency distribution of t ® 
respondents on each question and then proceeding to two-variable and mu ti 
variate analyses Thus, for example, the number of individuals of each sex, 
religion, and so on, would be tabulated. This would usually be followed by a 
percentage breakdown for each of the questions (to determine the proportions 
of male and female, and so on). Once this k accomplished, the results would 
be used as an aid to the development of a cause-and-effect analysis. 

With the advent of electronic data processing, such descriptive analyses 
take very little time. The most time-consuming task is not the processing of 
data by inachines, but the planning procedures of the investigator as well as 
his analysis and interpretation of the processed data. Thus, all the frequencies. 


320 



Sec. 132] Qualitdahe Analysts: Conceptualization of Variables 

a higher standard of living and better salary”; “Betterment of my material 
conditions”; “Comfortable life.” Increased income was sometimes cited, but 
response tended to focus on actual changes in consumption patterns. Housing 
constitutes the item most frequently cited— e.g., “A larger flat and better 
housing conditions"; “I bought a house of my own. and now there is enough 
room for the whole family.” A variety of other goods and services was men- 
tioned, such as “I have more access to hospitals, medicines, doctors”; “I 
bought a refrigerator and an electric stove”; "We eat better"; “I started to 
dress better"; "I started to devote myself to luxurious things.” 

2. Cultural. A marked change for the better in cultural life is noted by 
many of the respondents, with particular emphasis on variety: “I found myself 
in an atmosphere I was fond of and wanted to live in. Now I go more often 
to exhibitions, concerts, to the theater, cinema. I have a more heterogeneous 
life”; "We stay at home more. I had the opportunity to read books and to 
reach a higher cultural level"; "Now 1 again have the chance to go to concerts, 
exhibitions, theaters"; “I have widened my cultural and entertainment life”; 
“I simply have a considerably more beautiful and better life here than in a 
smaller place." 

3. Occupational. The relief from hard physical labor and long hours, with 
the consequent opportunity for increased leisure-time activities, was often 
noted: "There is relief from bard manual work we had ai Radoish”; “J need 
not walk 15 kilometers to my house now”; "I never have to use the hoe again”; 
“The work here is easier than in the village"; ”1 do not go to the field now”; 
"When I have lime to spare I go lo the cinema”; "I have more leisure time.” 
A number of them mentioned being no longer unemployed. Other comments 
emphasized the more challenging nature of the new jobs and increased interest 
in work: "I found a job that corresponds lo my abilily"; "I am improving the 
quality of my craft"; “I work with great enthusiasm because it suits me”; “I 
advanced in rank and received u higher function”; “My exemplary work was 
orally acknowledged by the enterprise.” Even the job of the housewife was 
affected by similar changes: "As a housewife 1 started to cook the specialties”; 
"I keep the household a little tidier"; "Here I can do the home duties better 
because we have a better dwelling,” 

4. Urban aimosp/iere. Responses were not always phrased in specific terms; 
a number referred to the overall character or atmosphere of Skopje: "The 
atmosphere itself is difTcicDt, from a village into a town"; "Now 1 live in a 
bigger and much livelier town"; “The way of living is more positive than the 
previous one"; "There arc more possibSilies for everything because the town 
is bigger"; “The thought that I am a citizen of Skopje has made me feel 
prouder”; "There is another atmosphere here, other customs"; “I expected 
and found vividness in the cultuial and community life." 

5. Negaiive changes. A very few individuals indicated a type of change 
which was undesirable: “I am dissatisfied with my job, with myscU"; “As far 

333 



[Ch.l3 

ONE AND TWO VARIABLES 

illustralion is taken from an inlerview-suivey conducted m 
during the summer of 1963, The study examined the impact of migra t 
a rural to an urban area = The specific survey question to be used 1 ^ 

live purposes is; “What do you think was the bigpt change m I™ j 

result of your moving to Skopje?” The 284 individuals w o were . _ 

had all come to Skopje since World War 11 as part of an enormou 
that tripled the population of the city. , 

Respondents tended to be quite vocal in answering the ques on, 
answers, needless to say, varied a good deal. A primary task or 
gators was to organize or systematize the answers in such a way tna 
use could be made of this information. -iv in the 

The open-ended question was conceived of as functioning pnm^i -tuation 
context of discovery rather than in the context of justification. In is si 
one does not start with a fixed set of concepts; rather, ope examines 
sponses to develop concepts that seem to be fruitful. The situation ere 
lar to that portion of the medical student study when the students were 
to indicate what they like most and least about working in a num er ° ^ 

cal fields. Their responses seemed to fall naturally into a number of » ’ 

and these were subsequently utilized in the formulation of dosed ques 
the basic goals or values taken into account by medical students when c 
ing a field of medicine. h n is 

A first step in the coding of responses to open-ended quepions, we 
the formation of a number of non-overlapping categories into whic 
of the responses may be placed. Alternatively, we may say that the 
gator develops a nominal scale for the responses, since nominal scales 
sets of non-overlapping categories. The subsequent step in the medical ® 
study was the attempt to develop a series of interval scales, each one base on 
a given category of the nomind scale. As we shall see, in the Skopje s Y 
the nominal scale provided ibe basb for the development of a single new 
concept which utilized aspects from all of the categories within the nomm 
scale. 

The categories of response that emerged from the answers to the question 
“What do you think was the biggest change in your life as a result of yot*^ 
moving to Skopje?” were (I) standard of living, (2) social, (3) cultural, 
(4) inteUectual, (5) occupational, (6) activities, (7) urban atmosphere, 
(8) no special changes, (9) negative changes, and ( 10) personality changes. 
Some of these are illustrated below. 

1. Standard of living. Many of the responses were general with respect t<J 
improvement in the standard of living— e.g., “Life is better here”; “There is 

• Collsboraiing wuh the author in ibe Skopje study were D. MUjovska and V. TanesU 
of the Unive^y of SkopK. and irwin Sanders of the Ford Foundation. The study H 
described 10 -The Urban Personahty and ibe Rural Personality.” Department of Sociol- 
ogy and Anthropology, Boston University (mimeo). 

222 



Sec. 13.2] Qualiiative Analyui: Conceptuahtoiion of Variables 

a higher standard of living and better salary"; "Betterment of my material 
conditions"; “Comfortable life.” Increased income was sometimes cited, but 
response tended to focus on actual diaoges in consumption patterns. Housing 
constitutes the item most frequenUy dted — eg.. "A larger flat and better 
housing conditions”; “I bought a house of my own, and now there is enough 
loom lor the whole family.” A variety of other goods and services was men- 
tioned, such as “I have more access to hospitals, medicines, doctors”; “I 
bought a refrigerator and an electric stove"; “We cat better”; "I started to 
dress better”; “I started to devote myseif to luxurious things.” 

2. Cultural. A marked change for the belter in cultural life is noted by 
many of the respondents, with particular emphasis on variety: “1 found myself 
in an atmosphere I was fond of and wanted to live in. Now I go more often 
to exhibitions, concerts, to the theater, cinema. 1 have a more heterogeneous 
life”; “We stay at home more, f had the opportunity to read books and to 
reach a higher cultural lesel”; “Now I again have the chance to go to coricerts, 
exhibitions, theaters”; “1 have widened my cultural and entertainmeat life”; 
“I simply have a considerably more beautiful and better life here than in a 
smaller place.” 

3. Occupatio/ul. The relief from hard phy-stcal labor and long hours, with 
the consequent opportunity for increased leisure-time activities, was often 
noted: “There is relief from hard manual work we had at Radoish"; “I need 
not walk 15 kilometers to my bouse now”; "I never have to use the hoe again”: 
“The work here is easier than in the village”; 'i do not go to the field now”; 
“When I have lime to spare I go to the cinema”; “1 have more leisure time." 
A number of them mentioned being no longer unempIo>ed. Other comments 
emphasized the more challenging nature of the new jobs and increased interest 
in work: “I found a job that conesponds to my ability”; “I am improving the 
quality of my craft”; “I work with great enthusiasm because it suits me”; “I 
advanced in rank and received a higher function"; “.My exemplary work was 
orally acknowledged by the enfe/prise.” Even the job of the housewife was 
afTccted by similar changes: "As a housewife I started to cook the specialties”; 
"I keep the household a Imle tidier"; “Here I can do the home duties better 
because we have a better dwelling," 

4. L/rban atmosphere. Responses were not always phrased in specific terms; 
a number referred to the overall character or atmosphere of Skopje: "The 
atmosphere itself is dillercnt. from a village into a town”; “Now I live in a 
bigger and much livelier town"; "The way of living is more positive than the 
previous one"; “There arc more poswbUilics for everything because the town 
is bigger"; *Thc thought that I am a citizen of Skopje has made me feel 
prouder"; 'There is another atmosphere here, other customs"; “I expected 
and found vividness in the cultuial and community We." 

5. fiegai'ne changes. A very few indivkJuah indicated a type of change 
which was undesirable: “I am dissatisfied with my job. with myself"; “As far 

733 



ICh. tJ 

ONE AND TWO VARIABLES 

as young people arc concerned, I do not know, but rmiied^s'pare 

entertainment Idc is very poor”; “I work very hard and I have hm.ted spare 

time”; "There is disorder here.” j 

In 1938 Louis Wirth wrote, with respect to urban populaUons i 
States; 

The personal trails, the occupations, the cultural life, and the 
members of an urban community may ... be expected to range 
more widely separated poles than those of rural inhabitants. 

On the basis of the changes indicated by the respondents in this 
community of Skopje in 1963 seemed to illustrate the same 
referred to by Wirth. The special characteristics of the urbanized com 
can be brought into focus quite sharply when seen through the 
comers from rural areas or small towns. The uiban way of Inc, as . 
by the migrants, represents far more than an improvement in income, . 
general living standards, opportunities for employment, and formal c 
for children; it represents a whole new world of people, ideas, activi i , 
culture, It provides far more challenging occupational opportunities tm 
type of atmosphere which stimulates the individual to grow intelleetua y 
learn about a new way of life. U generally provides more leisure so tna 
individual can take advantage of these opportunities. 


Some Functions of Qualitative Analyses 

One function of such an analysis is that it makes the investigator feel so 
close to the phenomena under investigation that he has little difRculty ^ 
formulating hypotheses and theories about the processes involved. This 
akin to what is accomplished by a good novelist or playwright — his character 
izations come alive for his audience, and his audience feels as if it understan 
the behavior of the fictional individuals. For the scientist, such a feeling to 
the subject constitutes an important source of ideas within the context o 
discovery. Whether or not these ideas subsequently bear fruit cannot be 
assured, of course, for hypotheses must still survive examination in the 
context of justification. 

The analysis just described was instrumental in helping to develop several 
ideas about the urbanization process. One of them is the concept of hetero- 
geneous personaiiiy and of homogeneous personality. The heterogeneous 
personality is conceived of as tending to have a highly differentiated set of 
goals or values and to see many different alternative courses of action in a 
given situation. As a result, it tends to exhibit a wide range of behavior in 

“Louis Wirth. -Urbanism as a Way of Life," The American Journal of Sociolosy. 
XLIV: It (1938). 


224 



[Cb. 13 


ONE AND TWO VARIABLES 
of such csposorc. Fuclois that seemed to ‘^‘J'^Slopje'fc site 

to a variety of experiences ' of previous motes, 

of the community of prevtous tcsidence, income. Each 

the degree of educatton. the kind of occupation. sgou-n to 

of these hypotheses received "de variety of 

exrst between the various indices of degr^ o 

experiences and the degree of heterogeneity o P differentiation 

Some American studies of urbannat.on imply ^ jing of the 

is indicative of schieoid tendencies and j ,L As a 

responses of migrants to Skopje seems to provide ™ i„dieate 

matter of fact, the responses to the openutnded Nation ate quite 

drat the individuals who are experiencing personalily 

pleased with the process and seem to be well adjusted jj,„ jietcto- 

ment. As a result, it was hypothesited that a high P'f “ P'”™^ Jpothesh 

ceneity is related to various indices of satisfactory adjustmen . 
received confirmaUon >^hen the relevant data . -u^tions can 

It seems, then, that the descriptive analysis of open-ended q ° 
serve the follov^ing (unctions: (1) provide a “feel of dis- 

investigation, ^vhich might prove to be suggestive the Centex 

covery; (2) suggest the formulation of particular concepts or g 
(3) suggest the formulation of particular hypotheses. Often such P. . 
data suggest not only the general outlines of concepts and . of 

also very specific ideas for measurement. For example, the measure 
degree of personality heterogeneity involved ditTerentiation of goals or ‘ 
One implication of this definition is that the number of goals or value ^ 
tant to a ^ven individual might provide an index of degree of -gj 

This index could be utilized, (or respondents had been presented with a 

of 21 fixed-alternative questions concerning the importance (to them) 
different possible goals. 


13.3 TWO-VARIABLE ANALYSIS OF "REASONS" 

An Illustration: Reason Analysis 

Reason analysis (sec Section 6.4) is a scries of procedures for analyzing 
the range of possible "reasons” that might be ^ven in response to a parlicu 
Why? question. In the present context, reason analysis may be used to i us 
trate two things; a two-variable analysis technique (in the tradition of Jo n 
Stuart Mill’s method of difference), and a relatively structured approach to 
the analysis of open-ended questions. 

The example used will be an interview survey of 192 women in Cambridge, 
Massachusetts, which focused on their reactions to illness. Some of the ques- 
tions asked set up hypothetical cdioice situations. One of them is the following- 


226 



Sec. 13 J] 


Tno~Variable Analysis of "Reasons" 

11. Your (other, who ha> arthritis, now Inet alone ol the age of seventy^hree. He hos 
o tendency to neglect hit diet, moinly because he con't get around to shop 
eosity and cooking is difficult for tiini. Afso, his home is quite a mess. One person 
has suggested thot someone be obtoirsed who could shop, cook and clean (or 
him, while another (eels that he miglil be better off in a home (or the aged. 
Which do you think is best? 

Someone to shop, cook and clean (hooicmoker). 

Home (or the oged. 

I la. Who! ore the reasons (or your choice? 

Unlike the Skopje study, this analysis was preceded by the formulation of 
a definite set of hypotheses. As a result, the answers to the open-ended question 
could be used as preliminary evidence for or against the hypotheses. The 
three bjpolbeses involved were the foUowing; 

1. Choices (decisions) of human beings result in part from their values or 
goals. 

2. Choices (decisions) of human beings result in part from their expecta- 
tions as to the values that might be involved in each of the options under 
consideration. 

3. Choices (decisions) of human beings result in part from their percep- 
tions of social pressures for (he various options. 

These hypotheses will be recognized as being similar to those involved in the 
medical student study. 

The data In response to this open-ended question arc presented in Table 
13*1 which is the prcxiucl of proc^ures for categorizing the answ'ers. 

Table 13-1 shows the relationship among responses pertaining to goals, 
value expectations, and social pressures on the open-ended question (lla), 
and the choice made in the fixed-altcmalivc question (11). For example, 
twenty-four individuals who chose a homemaker in Question 1 1 also men- 
tioned the goal of individuality in Question ) la. On the other hand, only one 
individual who chose a home for the aged in Question 1 1 also mentioned the 
goal of individuality. On the basis of these data, we may conclude that there 
is a relationship between two variables (the goal of individuality and the 
choice of a homemaker for the aged). 

Another relationship which appears to exist within the table has to do with 
the value expectaOon for good personal adjustment in the father’s own home. 
A total of forty-nine individuals who chose a homemaker also mentioned this 
value expectation, while only one individual who chose a home for the aged 
mentioned it. Other variables which appear to be related to the variable of 
choice of homemaker or a home are (1) a value expectation of safety for the 
patient, (2) a value expectation that homes for the aged are unsatisfactory, 

227 



iCh 1 


TABlE 13-1 

analysis for respondents choosing 
and ■■home foe aoeo^- 




Examples 


Home- Home for 

inker Aged 




|« ton flo ond cemi 
ho wonit ' 

M hoi ks <'>end» 


EWFOATIONS 


so.!,.. "" F" 

ixio-e 

^■»’WiO o" ko"'« 
hit <»S*d 


h(ot oBOogh money fw 
o home' 

In the long ®» 

cheopet * 

» (te would he hoppiet ot 

home’ 


. I,- would hove 

'Theie "o 7 ®'"“ , 
tontot* wrth people of 
h,, own age ' 

TkarB « iomebody to 
hjc the po«>«"* „,0 of h.m 24 

odoy' 

. , worked .n hornet They 

<t tor g,n not good ' 

generolhf «"• 

liottory 


SOCIAL PRESSURES 


B would enpec* 


fvoNly reiPoniibiVdl' 
If\ gene'ol 



Sec 133] Two-Variable Analysis of "Reasons^ 

(3) a perceived social pressure from the father, and (4) a percehed social 
pressure about the responsibility of the family. 

In only one of these relationships arc there more indi%iduals choosing a 
home for the aged who mention the item in question. Only eight indisidu^ 
who chose homemaker mentioned safety for the patient, as compared with 
twenty-five who chose a home for the aged. Thus, we speak of the direction 
of this relationship as being opposite to that of the others, for the other items 
related to the choice of a homemaker while this one is related to the choice of 
a home for the aged. 

There exist statistical procedures which the investigator can utilize to make 
his decisions as to which items arc related to the choice of homemaker or a 
home and which are not (see Chapters 14 and 15). One advantage of such 
procedures is that they make explicit the criteria for such decisions. In this 
way, other investiptors are free to examine these criteria as well as the 
ultimate decision. In the analjsis described, by contrast, the explicit entena 
used to make these decisions have not been presented. ConsequenUy it is more 
difficult to obt^ agreement Uiat the interpretation made by the investigator 
is a wise one. The reader may suspect, for example, that the inv^tigator had 
a particular axe to grind. Once the aiicria are outlined, however, the decisions 
of the investigator are placed on a more objective basis, and it is consequently 
easier to arrive at consensus within the scientific community. 


Reason Analysis and Mill's Method of Difference 

The analysis illustrates Mill’s method of diSerence. We start with an imtial 
difference among individuals with respect to whether or not their choice in 
Question 11 was a homemaker or a home for the aged. Now we exai^e the 
data to see whether any differences in the items mentioned m QuesUon 11a 
occur. Any difference that occurs between the homemaker ^d home-for- e- 
aged groupings will be regarded as partial evidence for the existence of 

cause-and-effect relationship between the two vanables involved. 

We may infer, on the basis of the data in Table 13-1, that there is some 
evidence for a causal relationship between choice of homemaker or home ^d 
the coal of individuality. This is because 24 of the 140 respondent S 

“hoiemaker” (or 17 percent) also mentioned the goal of 
whereas only 1 of the 41 respondents choosing “home for the (2 

cent) referred to this goal. More generally, we may infer Uiat there s e«- 
dence for the validity of the hypothesis: choices of human bangs resuUmp^t 
from their v alues or goals. We may also infer that there is some evidence for 
the hjpotheses concerning value expectations and social pressures. 


229 



ONE AND TWO VARIABLES 

It is obvious to the prcseni-day scientist that the salaries oE 

Fo, „a.np.e. .h.,e is a dosa ^ Pri“ »' ” 

Presbyterian mimsters m Massacbosdts an S idation- 

Ha.ana,‘ but it wouU be rid.eulnns to that the 

ship therefore exists beween the j.e(Iect telahonsbip, 

three hypotheses tested, each of which states 

are true It should be rroted that the trfereoee drawn <«■" '“ ^ „a,i,g 

there rs “some evidence” for the hypotheses, whteh ts not the 
that the hypotheses are true. 


Reason Analysis and Miirs Method of Agreement „ah»databorn 

Mill’s method o/ ojreemenr may also be illustrated The 

Table 13-1. To do this, let us examine the column und 
numbers in this column may be viewed as ® j pic particular 

menl" among individuals between the homem^er 
hem under consideration. For example. 24 tniyiduals who chose^^ 
maker also mentioned the goal of mdividnality, homes lor 

personal adiustment would be belter in his , agreement indi- 

the aged ate generally unsalislactoty, and so on. TOe degree ol ^ 

cated by each ol these items provides limited evidence lor ^ ..jj^^ae 

cause-and-eflect relationship. The same is Ime lor the 0“'”” [,equently 

for the Aged,” where the item on safely tor the P®.'""' sly, lot 
mentioned by those individuals who made this choice. One ^ 
example, that there is some evidence to the edect that an onpe ^ 

personal safety of an aged individual will be very great m a home 
constitutes one reason for the choice of a home for the aged. method 

Of course, such an inference is much more soundly based whe 
oj difference is brought to bear on the daU. Thus, the inference is mo 
sible when we can say that 25 of the 42 individuals (60 percent) 

“home for the aged" also mentioned safety, while only 8 of the 
viduals (6 percent) among those who chose “homemaker" also me 
safety. Had these percentages been identical, a concern with sale y 
have been no mote related to the choice of a home for the aged t 
choice of a homemaker. Uwng the method of agreement it is not possi 
detect such an equivalence, »nce this method focuses on either those ^ 
viduals choosing “homemaket” or those choosing “home for the age . 
not on a comparison between these groups. Thus, the method of g 

would have implied a conclusion contradicted by the method of differe 


to tell the truth. For example. respofxJean may attempt to give a reason not 

that IS perfectly consistent wnh thwr choice in Question II, even though it oo® 
represent their true feelings. , , /xj.w 

“This bit of information comes from Darreti Huff, How to Lie with Slalislics t 
York; W. W. Norton and Co , 1954). 


230 



[Ch.l3 

ONE AND TWO VARIABLES 

his willingness to convey them. Of orthe r^sp^dent is in 

that the approach that depends on the of fact, it is very often 

any sense inferior to one that docs not, non-introspecdve 

useful to supplement one type of analysis wlh relationship of any given 

approach involves simply the cross-tabulauon or P ^g^standing 

variable with any other variable in order arrive at the 

of a given process These variables may ai _„f,nc of the processes 

action meaning (that is. the respondent’s own j^^aning- 

involved) of the respondent without dir^ly ascerta g meanings 

The two-vanable analysis of data which do introductory 

of respondents may be illustrated by an student 

sociology class One of the requirements of the analy- 

analyze questionnaire data and submit a P . y. mcluding the 

sis. The assignment involved a considerable amoun . 

use of IBM data-processing equipment and the /^sked whether 

tical tests. At an early stage in the assigomem, 
or not they were interested in it. Results are shown m Table \i i.. 


( TABLE 13>2 

• DECREE OFJNTERES^N^A CUSS 

Inferosled NoKnieresled j 


\ 


(% ) 

56 


Tefal 

(%}_ 

100 (Ns ■43) 


.J 


Some Convcnfion# for Tabular Presenlation: One Variable 

Several facts about auglc-variablc data should be emphasized. 
should be noted that Table 13-2 includes a reference both to the 
involved (“degree of interest”) and to its different caetgories (‘inleres t 
“not interested"). In addition, the table indicates the percentage of stu e 
who were interested, and not interested. Percentage analysis is utilized ex 
sively in the behavioral sciences because of the clarity with which it . 
information and the ease with which it allows comparisons to be made. * 
will become apparent in Tables 13-3 and 13-4, which present data on tw 
variables. One additional fact about Table 13-2 is that the total number (o 
frequency) of students in the class is hsted in parentheses. This is done because 
percentages can be quite mtsleading when we do not know what frequencies 
are involved. For example, the statement that 67 percent of students are in- 
terested and 33 percent are not may mean that two students are interested an 
one is not, or it may mean that 200 students are interested and 100 students 



Sec. 13.4] T^o-yariable Analysis of Structured Data 

are not When the total number of students is included in the table, a reader 
may calculate the frequency of students who are interested and those who are 
not by multiplying the decimal equivalent of each percentage by the total 
number of students: 


Interested 
Nat interested 


0U4X43 = t9 
0J6 X 43 = 


The question mi^t be raised as to why the frequency was not panted 
along with the percentage for each category in Table 1 3-2. Actually, this m^t 
well have been done. There are no standard rules for the construction of tables 
other than one widely accepted general goal: clear and unambiguous «m- 
munication. Because only one variable Is involved in Table 13-2, it would not 
be particularly confusing to include frequencies along with the percenmgw. 
When two or more variables are included in a given table, however, an mclu- 
Sion of all of the frequencies as ueU as the percentages tends to m^c it some- 
what more difficult to obtain a clear picture of the data. Oarity ts esp^ally 
important if we take into account the fact that the scientific literaffire m ^y 
given field is voluminous, and that the amount of time which may be devoted 
to a ^ven research report is necessarily quite limited. .... j 

One final point about Table 13-2 has to do with something which is 0™^^“ 
rather than included— the way io which the variable "degree of interest fiu 
into the researcher's theory. In particular, does the invesUgator have m i^d 
any concept represented by this variable which consutut« an element of 
theory and which is perhaps more abstract than the vanable imder coiuider 
aUon? la the analysis of data, it is all too tempting to stay at the concrete znd 
specific level of the wording of the question (m a survey) and negl^ th 
development of more abstract concepts, but the scope of ffie study 
reduced. In Table 13-2, “degrees of interest” b conceived of as get^g 
particular goal or value— namely, the performance of the cl^s assignment 
The substitution of the word goal for itucresi does not by itx mcreaw 
scope of the study, but the concept of goal or value is already conceived o 
by the investigator as an important element of proposruons and theory. 

Some Conventions for Tabular Presentation: Two Variables 
Afur the introduetoty studeitts completed theh essighmeet, 
mpom »ere praded. He mveetipator at that pomt focosed ht, 
atwmpt to dieter the tactoe. that alleeted the pr^ea oa th« 
h>pothesb tested w as that the degree of interest in the pa^r cc 
obtained. (The data that test Ihb b)T>othe5is are presented in T^lc I^) 
Althoash Table 13-3 may loot simple enoush. sese.al of *= 
iasohed tVqoire eapIaaaUoa. Pint, the peoetal fomtat o quite ‘“■'f “ 
of Table 13-2. Both specify the sanable(s) msohed as » ell as * P 
categories of each. Also, both tables deal uitb petceatages aad present the 



lCh.l3 

ONE AND TWO VARIABLES ^ 

frequencies ,n parentheses. Table 13-3-an Jr*<«4 deal 

because It is composed of four major cdls or entr iacluded. 

more elmteted if the frequency associated «nh each Ff“" S 
UnhU Table 13.2, in tshich the ^fnentiiEns ^ Jj^jisaate d,v” 
direction. Table 13-3 presents a choice of dependent 

oj interest as the independent or causal satiable and s 


TABLE 13-3 

GRADES OBTAINED BY STUDENTS WITH DIFFERENT 
DEGREK O^INTEREST IN A CLMS _ASS1GNM«;^^^. 

DAgres of Inferest 

; InlAresfed No» Inferesled 

Cfod« »- — 


B— or kighor 
C+ or tow«r 
Total 


75 


or cQect vanablc in this table. Then the totals used to calculate the pc 
may represent summations of the frequencies in each of the . • jjjg 

independent variable or the dependent variable. If they are calcu a * 
independent variable, as they are in Table 13*3, we say that the pe 
are computed “in the direction of the independent variable. 

Table 13-4 is idenucal to Tabic 13-3 except that its percentages ^ 
puted in the direction of the dependent variable — i.e., the totals ^^*^g2te- 
percentages are based arc summations of the frequencies in each o e 
gorics of the dependent variable. 


TABLE 13-4 


i GRADES OBTAINED BY STUDENTS WITH DIFFERENT | 
; OWREES OF INTEREST IN A CLASS ASSIGNMENT J 


Degree of fnlerest 
Interested Not Interested 

\ B- or higher *5 35 

^ C+ or lower 31 69 


Total ! 

f%;_ J 

lOOtITl 1 
100 ( 26 ) ! 


236 



Sec. 13.4] 


Two-Variable Analysis of Structured Data 


It is generally easier to communicate information when percentages are 
computed in the direction of the independent variable (as in Table 13-3). 
This can be understood when we attempt to state the results derived from 
each of the two tables. Table 13-3 indicates that 58 percent of those students 
who were interested in the class assignment received a grade of B— or higher, 
while only 25 of those students who were not interested received a grade of 
B- or higher. Thus, the percentage of high grades is substantially higher 
among those students showing interest in the assignment than among students 
who did not. It would of course constitute an equivalent statement to say that 
the percentage of low grades is substantially lower among those students show- 
ing interest in the assignment than among those students who id not. 

Table 13-4 indicates that the percentage of students showing interest m 
the assignment is substantially higher among those students who re«iv^ high 
grades than among those who received low grades. Thus, the reasoning is from 
the dependent variable or effect (those receiving high grades and those receiv- 
ing low grades) to the independent variable or cause. By contrast. Table 13- 
compares percentages between the two categories of the independent vanable 
(degree of interest) and reasons from the independent variable to the dejwnd- 
ent variable. Because it Is easier to reason from cause to effect rather than iroui 
effect to cause, it is conventional to compute percentages in the direction of 


the independent variable, as In Table 13-3. 

Another consideraUon is the degree of accuracy to which percentaps should 
be computed. In Tables 13-3 and 13-4, they were computed to the ne^st 
whole percent. This degree of accuracy should suffice for most pur^ses. me 
great advantage of not computing percentages to the nearest ten* or one- 
hundredth of one percent is the clarity achieved. The situation would be differ- 
ent, however, if, for example, this rounding procedure obliterated very shght 
differences between individuals in the two categories of the independent 
variable. Let us suppose, tor example, that 10,000 students »er= mteresud 
in the assignment and 10,000 more were not. Let us also “suine ^at 42.4 
percent of the interested students obtained high gra^s while 41.6 pereent 
of the uninterested students obtained high pades. This difference may 
statistically significant, but it would be lost if both fipires were roun 
the nearest whole percentage poinu Of course, such a situation hapi^ns ramly. 
A more usual reason for calculating percentages to the nearest 
percent is to conduct more exact statistical tests. Even in sue case , . 

it is not essential to present figures that arc accurate to the nearest tenth of 


1 percent in the research report. 

It we gnmt that a rclaUonship hax bean established bewcen 
cst in the class assifinmem and grade received, what genet imp 
sueh a finding have? Hem we mnst took to those ^ ? T dS 

that represent a higher level ot abstraction. Degrw o .^trament 

assignment may be viewed as an indicator ot the value ot the class assignment 



ONE AND TNVO VARIABLES 


lCh.U 


ONb Al'iU _ 

to tht mdividoal, while the grode oo the jfj" ? ^ 

letge number ot choices or decisions the ^ Jcl of o>>s>rnction 

ol his report. Thus, one ptoposilioo icptcsenling higher 

than that which relates interest to graito is the , Cambridge, 

onsly discussed in the contest ot the study of teaetions “> ,»„> 

Massachusetts: Choices ( Jecis/ous) ol hurnun eiidence 

values or goals. At this level of abstfaclion, the data constitute p 

for 3 proposition of much wider scope. 

/lddi«jonfl//ndcpcndcn/J'aria6f« , 

Thus far in the search for factors which aff^cd the factor 

one factor has been considered— interest m the assignmc . ^as 

that was tested had to do with the scaUng arrangement of the s ud 
noted by the instructor that about half the stodenu tended ‘o fiudcnts 

first three rows of the large lecture room while most of the rern 
were packed in the rear of the room; few students occupied .„di- 

This pattern suggested that the seating arrangement might 
earn, of ihe dcg«e to which the student valued the class leelu, s and disc« 
sions. Table 13-5 contains the data used to test the hypothesis tha 
arrangement affected grades on the research reports. 


GRADES OBTAINED BY STUDENTS WITH DIFFERENT 
^ SEATING POSITIONS 

Seoting Position 

1 Rows 1-3 Rowi*-I5 


(V,) 


a— or highor 
C+ or lower 
Total 


The relationship emerging from these results seems to be about as 
tial as the one between interest in the assignment and grade. If we considere 
these data in relation to a proportion at a higher level of abstraction, we 
might say that partial evidence is also provided here for the proposition that 
choices of human beings result in part from their values or goals. It should be 
noted that Table 13-5 utilizes data collected by means of observation as well 
as through paper-and-pencil tests. Although most of the examples of analytic 
procedures presented here ate based on questions put to respondents, pro- 
cedures for analyzing other types of data are generally no different from these. 



Sec. 13.4] 


Two-Variable Analysis oj Structured Data 

When these results were presented to the students in the class, they seemed 
generally to be quite surprised. Few of them seem to have suspected that the 
factors of interest in the class assignment and seating position would be closely 
related to grade on the assignment. This points up the importance to the in- 
vestigator of cross-tabulations that are not based solely on the respondent s 
definition of the situation. This is not to deny that the respondent's definition 
of the situation may be quite suggestive in its own ri^t. In this case, the 
students v-ere asked what other factors Uiey thought might have affected the 
grades; some of the answers are analyzed in Tables 13-6 and 13-7. 


TABLE 13-6 

GRADES OBTAINED BY STUDENTS WITH DIFFERENT 
SOCIAL SCIENCE BACKGROUNDS 


Grade 


&— or higher 
C-h or tow«r 


One or More 
Courses in 
Seeiolegy, 
Psychology, 
or Anthropology 

" 41 


No Previous 
Courses in 
Sociology, 
Psychology, 
er Anthropology 

38 

62 


ICO 

(21) 


.J 


TABLE 13-7 

GRADES OBTAINED BY STUDENTS FROM DIFFERENT 
__ col lege CLA55 ^_ 

Junior or 
Senior 

f%l 


Freshmon or 
Sophomore 


The data in Table 13-6 do not indicate “S' "“‘’“I'lE 
science background and grade. Aldnwgh it nr.ght be argud f ‘ f 

sludenu antfng Utose nith d» gntants. social sc.cnce 
high grades as compared »ilh only 38 percent of those ».th the least back 



ich.ij 

ONE AND TWO VARIABLES 

ground, sooh n porocnlagc dillcinncc U 

cics arc taken into account. As a matter ol fa^ of ^ 

greater background 9 of them obtained a B n ©r higher. If ih« 

21 individuals viiih the lesser background obt^n‘=|! ^ ® 41 per- 

frequencies were very much higher, however, the dilTcrc 
cent and 38 percent might have some significance. 

Table 13-7 indicates a definite relationship between scienUsls^ 

received. We may infer that the insighu of professional sw al 
not the only ones cffcclisc in explaining human behavior, ‘be d^fi^ 
the situation by those individuals whose behavior is to P 
tutes another important source of ideas. Once ihu . ,n,ains 

lege class and grade on the class assignment is shown, bowever, ^ 

to seek an explanation on a more abstract and theoretical ' 
indicates that it is probably not the additional courses , „piana- 

lakcn that gjve them an advantage over lower classmen. One po have 

tion mi^l have to do with study habits — c.g., upper jhe ex- 

learned to receive or perceive mote accurately than ^ .,,[ioo 

pectalions sent by the instructor as to what is required. Such an ‘ "P jj 
would have to be checked by the collection of addition^ „_-a,ijed 

the instructor might give the class insinictions that arc then to be s 
by the students, and the different classes might then be compare 
accuracy of their interpretations. Other interpretations of the data n 
13-7 are also possible. , ,.„mu- 

From the tables presented, it can perhaps be seen that research is a 
lative process. In the search for factors affecting grades on the class a s ^ 
ment, many different hypotheses arc possible. For any hypothesis a 
verified, it remains to obtain a theoretical interpretation. At that poin , a 
ditional data most probably will be needed to check the alternative i 
terpretations. 


SELECTED REFERENCES FOR FURTHER STUDY 

Freo M. Kerlinccr, Foundations of Beha\ioral Research (New York. Holt, 
Rinehart and Winston, 1964). The wide range of topics includes the language 
and approach o( science; sets, relations, and variance; probability and 
inference; designs of research; types of research; measutemeni; methods ° 
servaiion and data collection; and analysis and interpretation. There is particula 
concern with applications in psychology and education, but most of the boo 
is of general relevance to the social scientist. 

Paul F. Lazarsfeld and Morris Rosenberq (cds.). The Language of Social 
Research (New York: The Free Press of Glencoe. 1955). This reader consists 
of 64 articles on a wide variety of empirical studies. The major parts of the 
book deal with concepts and indices, multivariate analysis, the analysis of 
238 



Selected References for Further Study 


change through time, formal aspects of research on human groups, the empirical 
anal)'sls of action, and genenU probletm in the philosophy of social science. 

Hans Zeisel, Say It With Figures (New York. Harper and Row. 1957). Detailed 
dbcussions of the very specific procedures utilized by ibe survey aaal>'st are 
difficult to find. Such discussions are presented here, and some of the topics 
included are the functions of percentages, how to handle the don’t know's and 
no onrH'er’r, reason anal^-sis, true and spurious explanations, and the panel study 


337 



Principles of Statistical 
Decision Making 


14.1 THE NATURE OF STATISTICS 

Functions of Slatislics . 

A fairly recent conception of suUstics is 
wise decisions in the face of uncertaimy.” ‘ This d thoucht of as a 

held conceptions of statisUcs. For example, staust.es .s ofien Aou^of 
collection of facts stated in numerical terms. This is p y ? relevant if 
the fint definition, lor it Is desirable to otite .hatever 
one is to come up with wise decisions. Moreover, the Pt^ . 

have been lound to be most usclul when attempts have “ “PP ^ 

them to phenomena under study and use them as a . collection 

One important distinction between a conception o s former 

of facts and as a body of meiftods for makmg *^^se jmiijecUy brings 

makes no reference to any human agent whereas decision 

in the decision maker by refemng to decaions But who ^e ^e d«^a 
makers to whom reference is made? What kinds of decisions . 

in making? And by what criteria may wc judge whether or not a decision is 

" Alording to one point of view, ‘the purposes for which 
collected can be grouped into two We practical actions 

loosely as practical action and scientific administrator on 

may include anything from the decision by a u ^ not 

whether or not to raise tuilioii to the cVuam^wlth deebious that 

to carry her umbrella. Scientists, however, arc ^ 

lead to the furthering of knowledge o^iuatc the wisdom of a 

versity administrator and the secretary, prob y 
■ W. Alleu a,., ll.rrr V. Rob.™. 4 ,NV. |Ne- V«S: Hw 

Fite Press of Glencoe. 1956. P- 5)- 
•Ifciif, p. 4. 


341 



PRINCIPLES OF STATlsnCAL DECISION MAKING 1“ 

given dediion on the basis of Ihe effea of to ““‘“™ J SToes ’.A 

cjample, the secretary decides to carry her irmbrella “<* « “ 

her goal of avoiding being drenched rs j.jii.ble evidence, 

as an intelligent one. A scientist may deade, on the exoloring. If 

that a particular hypothesis is valid and is rhejefore wo^ ^feisio. 
leads to important discoveries he too vdU probably decide that 
was a Wise one. _ j .1,. cr.i<antkt may he 

The decision-making procedures of the laym^ and . - deci- 

quite similar. For example, the university factors 

Sion about a tuition increase, probably takes into account a 
or goals. For example, additional funds may be costs, or 

faculty salaries more competitive, to pay for increased be 

to finance the expansion of the physical plant. On the ot er . 
concerned about the possible loss of high-quality applic^ts . . . „ 

His action may constitute an attempt to make f^now, 

maximizes gams and minimizes losses But how does the hieh-auality 

for example, what the actual effect of a tuition raise will be on 
applicants to the university"’ The chances are that he knows v ry 
this, but he must make his decision even in the face of oncertai y. 
administrator gathers whatever facts on the situation he can an . jjis 
his decision in such a way as to maximize the probability 01 a 


The scientist too has numerous decisions to make, such as the e 
to which of several theories to attempt to develop and test further. 
lion of the most promising theory may lead to a substantial increase 
tific knowledge, whereas the selection of a stenle or invalid of 

The scientist, like the administrator, must make his decision in the 
uncertainty. Although data on each theory arc available, results are “ . 
certmn in any science. In the behavioral sciences in particular, resu ts 
far have tended to be quite limited and tentative. The wise scientist care ^ ^ 
examines whatever data are available and then makes his decision in s 
way as to maximize the probability of achieving his goals. . 

Statistical procedures may aid in the decision-making process of bot 
administrator and the scientist. They may aid in the collection of additiona 
data by specifying, for example, probability sampling procedures. They may 
aid in the analysis of the data by providing quanlilalive rules for inferring 
nature ot reality from these data. They may aid in the decision-making process 
by replacing an intuitive method with a highly objective and quantified process 
that makes maximum use of the data available. 


Slalistics and Scientific Communication 

If statistics is conceived of as “a body of methods for making wise decisions 
in the face of uncertainty,” how is U to be distinguished from the research 

242 



The Nature of Statiztics 


Sec.l4.1\ 

procedures discussed in earlier chapters? Is it not true that all research strategy 
attempts to accomplish the same end? The key distinction between statistics 
and other research methods is that statistics provides a quantified approach 
to research. Statistical procedures rely heavily on mathematics and, in pa^cu- 
lar, on the concept of probability and the rules for combining probabilities 
that have been developed by mathematicians. 

Somewhat more insight into the nature of statistics may be gaine we 
examine what this approach achieves In the context of justification. StaUsUcs, 
like any other research strategy, is not a disembodied set of rules which is 
“right" for a certain class of situadons; rather, it is a set of procedures utilized 
by scientists in their search for the nature of reality. As such, staustics may 
well be evaluated with respect to its impact on the communication process 
that binds together the scientific community. . 

Statistical procedures may serve to make quite clear the asis on 

«hich the sdenUst collects, analyzes, and inteqtrels his data. Knomng ttis. 
Other scientists are in a position to understand more thoroughly w at as ce 
done and to make use of this information in furthering their own e o . 
may be decided, for example, to repeat or replicate the rescarc con “ 
by another scientist, or to introduce certain modifications in theory or m 
in order to examine the impact of these changes. Other scientists may 
satisfied with the analysis and interpretation of data collected by a g 
scientist. If that scientist utilized statistical procedures, however, itiey are 
a position to better understand the basis for his conclusions. . 

StatisUcal procedures provide more than the possibility of effective com 
munication within the scientific community. Scientists largely agree 
a lawyer would call rules of evidence. Thai is to say, there is eoasider^k 
agreement on what kinds of data arc admissible as cvi ence an ^ . 

ventions may be followed for assessing ihc implications, for a p 
hypoihesis, of a given set of data. It is in this area that statistical 
«d„„s play a„ impona-t role. Tha convcnlions or roira ' 

implicalions of such dau are often based on ezphct stansucal 
which predict the probability that the scientist is correct or is in 
decision on whether to accip. or reject a given 

cedurcs also play a role in the collection of data which is a . j 
evidence. The role of statistics in probabilily sampling proce . ^ 

been referred to. Another example is that of randomization 
context of the experiment. These rules or conventions 

evidence also aid in the communication process among i • ^ 

pie, even if a given scientist docs r». subscribe to 

conventions, at least it wtU be perfectly clear to him P . whatever 

been utilized, and he is thus in a portion to make maximum 

has been accomplished. 


3«3 



PRINCIPLES OF STATISTICAL DECISION MAKING 


The Context of Discovery 


There are at least four »ap in which Km S 

A the scientific community in the context of discovery. :„ct;fica- 


aid the scientific community in the context of ^ : U^ca- 

rect-n^Cy, through ^aSS caieW. 


lion. For example, a conclusion, made on the basis oi s . 

that there is substantial evidence for the validity of a g ^ ^veo 


that there is substantial eviaence lor uic of a Dveo 

to point the scientist in a certain direction. Through e ? gjopment and 
w.L.k.cI, .hp c.i.ntut learns that further work in the devc opfen 


hypothesis, the scientist learns that further work in c . , . qp ,be 
testing of that hypothesis and related hypotheses mig the sci- 

other hand, if the evidence obtained docs not support the >po 


entist will be led to alter it or look to other hypotheses. 


:ona way in wnicn siaiisiivb uuiicih w/v rontext 

u than a reality. At prescM. K'.ludlnTsct ol ™>“ 


of justification may be widely accepted, no corresponding set o ru 
hppfi fpsteit and widely accCDtcd in ihc COOtCXt of diSCOVCry. 


been tested and widely accepted in the context of d>sc®vciy. .-ample, 

selects the methods and theory he believes will be most fruitlu . tg. 

the process whereby a given scientist is exposed to the resu s . j,g ^ 
search studies tends to be very unorganized. He may or may n ...(jure 
a particular journal or book, and he may decide to adopt a resc P g 
or theory simply because it is familiar. Or he may be impressed by V 


of the originator of the method or theory. . . , j.gision 

These decision procedures are a far cry indeed from statistic 


widtly acctpled, 


procedures If certain goals (or the scientific enterprise are wiaeiy 

then it may be possible to utilize statistical decision procedures to « ^ 

evaluation of the various methods of discovery. For example, cvi eo jjgg. 


evaluation oi me various meinoos oi oiscovcty. rui cAau.j..-. •• nUec- 
be gathered about the degree of utility which each of the various da 
tion methods has had (or the explanation of human behavior, ® 
of situations in which each seems to have been most effective. On t e 
of such evidence, rules mi^t be developed for selecting some 
over others in certain situations, and these rules might even be exten e 
the selection of hypotheses or theories. • t as 

This does not mean that complete consensus is possible among f . 

to which method or theory is most appropriate for a given research situa 
The degree of agreement will depend on the weight of evidence for a 
lar method or theory in a given situation. The determination of this weig 
would, in turn, depend on rules for the acceptance of evidence. And agait*’ 
although any given scientist may not agree with his colleagues on the rules or 
accepting data as evidence or those for weighing it, he will at least be able o 
understand more completely the ba^ on which his colleagues arrived at their 
decisions. He may even be encouraged to collect the kind of data wh*ch mig 
indicate to his colleagues that other rules of evidence and decision making 
are more effective for moving toward the ^als of science. 



Sec. I4.I\ 


The Nature of Slatbtics 

A third way in which statistics may be of aid in the context of discovery is 
also more of a program than a reality. Here we may utilize an analogue from 
the context of justificaUon. Within the context of any given statistic^ test ol 
hypotheses, the imestigalor has a number of options. For example, he has to 
decide whether his finding of a relaUonship that actually does not exist is 
more disastrous than his not finding a relaUonship that does exist. StaUslical 
decision procedures provide a mechanism for numerically weighting these two 
results so that the investigator may make his choice. Thus, a egree o co 
sensus has been achieved on the choice of tacUcs for deciding exacUy how a 
p\cn slatisUcal test may be applied. An increase in this degree o 
awaits further proof that some tactics are more effective than others in gi 

the context of discovery, the invcsUgalor generally does 
an entire range of methods or theories from which a c oice is _.i, ’ 
rather, he centers on one particular method or theory, an t e c oi 
U among tho aevaral options that exist tor the coliectton of ^ 

context of this method or theory. For example, m the use of intttv c g P 
eedures there are choices to be made on the ,0 mter^ew 

ended and fixcd-altematise questions, on the degree to » n-gthods 

is to be standardixed, on the use of projective techniques 
on the way in which data are to be recorded, on the vano ^ noints as 
the sociaiisirability variable is to be handled, and on many other pomU « 
*=U. At present only vety limited data are available on ™ 

ness of the different choices, and the data are even more h ^articular 

live eSectiveness in the context of the special ebamctensltc -tjt 
research situation. Thus, the invesUgator b generally 

intuitive decisions, and the basis for these decistons is rarely commumeated 

to the scientific community. . n f!e- 

It would be possible to rauonalize the« procedures so ^sro achieve a ^ 
gree of consensus on a strategy for making these less-prag- 

research context. This would result in a breakdown o „_.f^hial attach- 
matic bases for making these choices. Thus, for examp e. P 
ment of a ^ven investigator to the exclusive use value 

all research situations might give way to a degree 

of a certain proportion of open-ended quesuons for & „enerallY accepted 
Naturally, each investigator would still be free to reje g ^ 
strategy.^ut at least be wouh, ^ “'>^"rgtdT= 

Other scientists have done, and he might be cnai e g 

tions designed to improve on the generally acc^te .cached 

it is perhaps superfluous to add that, in old Ppetiveness must be 

on a good deal of essence ior of 

presented. Such evidence, m turn, depends o pppu„,p predictions for 

theory that can provide thorough explanabons and accurate predict. 



r , ,, The Nature of Probability 

Sec. 14.2] 

From a niathEmaticiil point of view, the probability of an event is a numto 
that lies beween 0 and I, inclusive. More subjectively, ‘he pra^bility of a 

tossedcoin’scomingupheadsvvillbeOwhcnucarcccrtain a ea 

occur, .5 when the chance of heads occurring appears to be as great iB me 
chance for tails, and 1 when ue ate certain that heads vvdl occur. F^er- 
more, if sve consider a number of altcmativc or 
if one among these events must occur, then the sum of i e pro a 1 1 
separate events is equal to 1. This may be illustrated y . . 

and considering the alternative events: two heads , 

tails m or TH), and two tmls (7T). If '‘c.assume^at the Probab.hty^of^a 
heads on a single toss is .5, then the probability of H is nrobabilitv 

.25. Similarly, the probability of TT will be .5 X .5 or probabjhty 

of Hr (heads on the first toss and taiU on jbe ^ X - 

probability of TH is the same, i.e., .5 X -5, or .25. » V 

HT or TH is .50. These probabilities arc , 

i Probability of 2 heodi = p(HH) = .50 X -SO = .25 

I Ptobability of 2 toll* = ptm = 40 X .50 =; .25 , tni j. » so 40) = 

I Probofailjyef 1 head, o«d 1 tad, = ptHD + ptW = ‘-50 X 40) + (.50 X 40J - 

* Piobayiily of 2 heodi, or 2 loilv, «• I 1 ’«'b - 

i pirwi = .25 + .25 + .50 = 1 .00 


MuUiplicaiion and Addition Rules .. . 

The above eiaiupfe Illustrates several principles 
probabilities which are fundamental to the mathcinauc . . q 

ability. One of them is the muUipUcallon m/e, which smtes ^“t f 
ere two evcnlr, iWependen, of one mtothcr, the F '" ''' B 
apdBi.,hepwduclct,l,cpwbpbie„olEvcplAapd:h,pnbM,yolE’.^'“B 
For purposes of iflustration, fet us define Eveut A as 
EventTas heudr on second .o». These rwo events ate 
another because obtaining heads with the first toss canno —uiiiniication 
not heads is obtained w4 the second toss. ” 

rule, the probability of heads toming op on both ftc first 
equal to the product of the ptobabihty of a heads oa ,npmatical terms 
probability of a heads on the second toss. Stanug this m raathemahcal term 
vvehave: 

Probability of 2 heads = p(HH) = P(H) X ^ 

This multiplication rule was used in the above example for jjly 

p(«H).P( 7T). pfHD. and if 

Another principle illustrated above is the addi . „rababiUiy of gel- 

A.B.C. . . . r L man, Idly eve- 
ling either A or B or C ... or J is equal to the sum oj i y 



principles of statistical decision making 

the various types ol research f typc^ will be to open- 

predict what the reactions of individuals of various typ 

ended and fixed-alternative questions ou a siveu topic. 

A fourth use of statistics in the context i„itIolly 

that statistical decision procedure is itself a ,j,i„idecUion theoD 

applied to the choices of human beings, pus far, and analy 

has been used largely in the choices scientists fKC m ^ been 

sis of data. Thus, the choices scienusts It S possible, ho.- 

stressed, rather than the choices they uctuully do ‘ beings 

ever, to use statistical decision theory to focus on assump- 

(including scientists) actually do make m a given situa ■ ' aj,»ays 

tion needs to be made that human beings are incredibly r 
make optimal choices, indeed, the .u a way that each 

medical student study, where rationality was dpned in pa^icular ex- 
individual was conceived of as acting on the basis or medical studen* 

pectations and his own set of values. Furthennorc. as pjoveo 

study, no prior assumption is made that the theory has V 
in the context of justification. ,nv utility In e®* 

The question might be raised as to whether or not there is a y r 
ploying statistical procedures on any gven aspect of eocioloast* 

of fact, this question has been quite pointedly asked by se answer li« 
with respect to the testing of hypotheses in survey research. 
in the contributions statistical procedures can make in the con 
cation and discovery. Thus, the question is always a legitimate . gyj, 

question on the utility of a given method or theo^. When the a 
dence indicates that a particular method or theory is less fruitlu 
it seems wise for the scientist to select the approach that appears o . 
elective. At the present time it seems that statistical procedures « 

great value both in the context of justification and in the context o is 


14.2 THE NATURE OF PROBABILITY 

An understanding of the concept of probability as conceived mathematically 
is essential (or learning the nature of statistics. This term is a common one 
ordinary usage, and conveys the idea of the likelihood of a pven even 
occurrence. Thus, a highly probable event is one which will most likely occ * 
whereas an event with a low probability is not likely to occur. The roathema ^ 
ics of probability may be viewed as providing the basis for this kind o lU 
terpretation of probability. 

• See. for example. Itanan Selvin. “A Cnuque of Tests of Significance i" Survey R® 

search." American Sxiological Reyuw. XXII: 519-27 (1957). ,^1. 

* For a reply to Selvin, see Robert McGinnis. "Randomization and Inference 10 
ological Rewaich."xtm<ncenSoc«>Id*«a/i{e«iew. XXIU: 408-14 (1958). 



The Nature of Probability 

etry. In order to make use of the mathematics of plane geometry, U is 
necessary to interpret these terms or coordinate them with entiUes m ^e 
pbj-sical world. In addition, the axioms of plane geometry must hold true fo 
the phenomena under investigation. When the interpretation is ma e an we 
have evidence that the axioms hold, the multitude of theorems in plane georn- 
elry may be effecUvely applied to predict phenomena. On the other ha^, U 
one or more axioms do not hold, then the theorems may be incorrect. Thus 
for example, one of the axioms of plane geometry is that one an on y 
line can be drawn through a pven point so that it is par e o a g 
that does not contain the point. On the surface of a sp 

•• • iraw many lines through a given point that are parallel 

:t with-the original line. Consequently, this axiom does "Ot Md 
^ • hold for the surface of a plane. 


for the surface of a sphere, although it does 


the same is also true for the other axioms of plane geornetry. . 

expect the theorems of plane geometry to hold true for p ane sur , 

'“to the natut. of ptobahthty 

plane geometry which might help to explain the re ations *P . . ^ 

mathemto of probability and its applications in the aiea ^ 

tint, makiog. First, the mathematically uninten-teted tenn 

interpreted in several ways. One of them is known as the Ireq ^ ^ 

lion of probabWiy. For example, the probability of heads 

single tnss of a eoio is the relative tteqnenc, of heads '“"T* 

number of tosses. Thus, if a coin were tossed an infinite DU ^ 

heads came op 50 percent of the lime, then we would state that p(«) 

On the other hand if heads came op 41 percent of the Ume, “ = 

that p(H) = ,41. Such an intetptetaUon allows OS to f “““ 

uninterpreted terra probability and something that can e o 

Of course, it is not possible for Ute scientist or anyone el e to to« a «m 
an infinite number of times, and thus he must be saUs e w 
of tosses. Consequently, it is not possible for him to now , reality 

probability of any event is. This lack of certainty about natu« o r^al jy 
is. of course, chLcteristic of scientific investigation and 
human situadon. The best that the scientist can do is to estimate the tru 
probability” on the basis of limited data. as consti- 

\Ve may now view the addition rule and the rnu ip i 
luting axioms of probability theoiy. We may chec - * ' probabil- 

m the coin-tossing situation by using •*“ '"J" , i„ge uoinbei of 

tty. Thus, for example, we might toss each Let us assume 

times and, on the basis of the results, estimate pi ) loinilv a very 

Oiat it is .50 for each coin. Then we might toss the ^ ^ ^ 
large number of times in order to determine w e e freauendes (or rela- 
PITT) = .25, and pfl/r or TH) = -50. If such lelauve frequenaes 



PRINCIPLES OF STATISTICAL DECISION MAKING 1“' ^ 

I j n r and J Let us consider the following as 
the separate events A, B,C . . . ona j. i-c 

three mutually exclusive even«; ^ — ^ 

Event A = Occurrence of 2 h«Qd» = tHH) 4 

' Event B = Occurrence of 2 toils = (tt) ' 

' Event C = Occurrence of ^ blc 

These events are mutually exclusive because, for ^ Awordfflg 

for both two heads and two tails to occur in f*o/°ss« oi a ^ ^ 

to the addition rule, p(A or fl or C) = p{A ) -i- P( ) + P ^ 

.50 = 1.00. nius, like the multiplication rule, the addition rule was 
the above example although it was not explicitly calcu- 

The addition rule was also used in another part of t P 

lation of the probability of Event C (the occurrence o , the first 

tails) Here, there are two ways in which this event may P 
toss comes up heads and the second one comes up tai ° peeur- 

comes up tails and the second comes up heads. We may c mutually 

rences to be Events A and B, respectively. The two ^ . Ijlity of one 
exclusive. It was necessary m the example to calculate the p 
heads and one tails, which may be sttied as p{HT or TH), P 
This is a situation in which the addition rule may be applied, 
following: 

piAotB) = piHTozTH) = p(/fT) +p{TH) = . 25 + .25* .50. 


Interpretations, Axioms, and Theorems 

Having outlined and illustrated some of the mathematics 
will be useful to explore its relationship to interpretations of proba i y 
are useful to the scientist. The discussion here will be similar to that a 
do with the mathematical properties of the various scales of measure 
There, it wiU be recalled, interpretations were made within the contex ° 
illustrations as the play of a basketball team, and it was determine w 
the axioms that slated the properties of a given scale were found to o 
the illustration. . 5 

A more complete presentation of the interpretation of axiomatic sys 
involves definitions and theorems in addition to axioms and interpreta i 
Although it is beyond the scope of this volume to present a rigorous isc 
sion of the mathematics of probability, an informal treatment of ibe 
lion and function of a probability theorem will follow. The aim here is 
show how the mathematical definition of probability, as well as the ad 
and multiplication rules, axe incoiporated in a probability theorem. 1 
theorem itself will be interpreted so that it will be seen to be useful to e 
scientist in the decision-making process. . 

Probability is an uninterpreted term within the mathematical field of pro 
ability theory just as the terms point and line are uninlerpreted in plane geom 
248 



Sec. I4J] 


The Naiure of Probability 


pa heods) = - -ZS. 

This is the same vaiue as that obtained previousiy. and thus the theorem woiits 

ia this instance. . . 

The theorem may tdso be tested on the calcuiation of the probabiiity of one 
heads in two tosses. Substituting in the formula, we have. 

pit h.pdO = X IMl'tJO)’-* = ili X 

It shoaid be noted that tor this caicuiation there were two 8«ttng 

one heads: heads on the first toss and tails on the second, or a . 

toss and heads on the second. Because the result is the same ^ P 
arrived at on the basis of the rules of multiplication and a i on, 
checks out correctly once more. . . . ..wccac «• 

Finally, the calculation for the probability of no heads m wo 

pto h,pdO = 555^ X t.5(»«0P-« = 4i X ' 

= t(.25> = .25- 

Here once more the result provided by the theorem is the same as previ y 

TOs theorem is based directiy on the aiioms, althon^ “'TJomSdra 
it would require several other theorems from permuiauon getting r 

theory. The term iV! / H (W - r)! reprc«nts the 

heads, and this is multiplied by the probability of 0 ame orobability as 

r heads. Each way of getting r heads, however, has the 
any other way of getting r beads, because differences in 
reoce of a given number of heads do not change^e 

quence. Thus, for example, p(Wr)— nlHT) 
would be true even if the probabdity of a beads w« equ ' ' ^ ,^jien 

= (.I0)(.90) = .09; aiso, p(rH) = (.90)(.l0 = .09. *1. 

we multiply the probability of one of the vrays of g . adding 

ber of such ways, we are in reality doing the equiv „tiiization of the 

all the separate probabilities of getting ^eads. an ^ utilized 

axiom for the addition of probabilities. The axiom -^^labUity of one of 
within the term piH)' p(r)--% which shorthand 

the ways of getting r heads. Here the exponents P ^ P ^ ^ ^ven 

for the successive multiplication of each of the pro 

"Tht pl^ofation of some of the rules of P^f Sn"ofp^«^mTs- 
and a theorem (and pbability are related to data 

trates the way in which the mathmaucs of p Concept of probability 

about the real world. Thus far, one inteipretaU P^^^ illustrated, 

has been presented, and two axioms aiwl one 



PRINCIPLES OF STATISTICAL DECISION MAKING 


lCh.14 


live frequencies scry close lo ihem) arc m inlcrprela- 

elude that there is csidencc for these axioms ^shcn the frequency 

lion of probability is made. examole at hand is one 

A theorem that ^sould be of paiticular va « fo be^ of heads 

.hieh .ould preside us ..th the P«*^bilay of u 
in tossing a coin not only t*icc but any number ‘ mcs. ( ^ 
ways be dealing v^ith ovcisimijificd 

uUlizc larger samples.) Such a theorem ^ • . ^^^cre S refers to 

probability of getting exactly r heads in N losses of a • iheorem 

the number of tosses and r is any number from 0 to , (oHotting form, 

(the binomial theorem) does exist, and it may be suted m the follos« g 
which has been especially adapted to our example. 


^1N0MIAL DISTRIBUTION 


- ■; X ptMl'ptn’'" 




probebil'ly «( 


Aunbet «( w«yt 
e( 0*nln9 
t h«edt 


\! should be read factorial N, and it is defined as IN)(.N f)t 

■ siS";! i, =qual to (,)(,- l)(r-2) . . . f,.'”' 

holds true for (N - r)! Also. il should be noted that 0! is defined 
Let us first test this theorem on the calcuJalions previously ma 
probability of two heads, one head, and no heads in two tosses o ^ 
ing the calculation of p(2 heads) firsl. Here, N ~2, while *■ ” * ’ pf 

stituling in the formula, we have the following for the number o 
getting two heads; 


_ _ giui _ ^ - s 1. 


Q){l) 

fUN - r)l ~ 2ia - 2H ■" aJtlJOl “ t21U) 

Thus, according to the calculation, there Is only one way in which 
may be obtained in two tosses of a pven coin (and this checks out wi . 
previous example) : HH or beads on the first toss and heads on the se 
(Of course, all of this may seem exceedingly trivial, but the results wi 
be as obvious when we move on to a more complicated example.) The p 
ability of this one way of getting two heads is calculated as follows. 


plHl'-ptnv-r = C50)»C5t»a-» = .25 t.S0)®. 

Because any number taken to the power of 0 is equal to 1, we have 
.25(.50)® = .25(1) = .25. Thus, substituting these calculated values in the 
equation: 



Sec. 14 J] 


Siaiislics ond the Decisions of the Scientist 


altemaUvcs. Let us assume that the ins-estigator starts uith the hjTOlhesis tba 
the coin U tait-i.c., that p(H> = .50. This startieg hj-polhesrt, which is se 
up tor purposes of testing, is called the niill Ajpolhetis and rt sjmbotod ^ 
H,. An altemaUvc is the hypothesis that the coin is not fair. This is 
ntominVe liypolteis and is symbolized as H.. This temnnology is used as 
foUovts to state the two alternatives invoived: 

Hojp(H) = JO. 

H, . ptH» ^ JO. 

The investigator has two alternatives: he may decide that O'' “i" “ 

Le., accept H. (the null hjpolhests); or he may decide Omt Oie Min n 
lair. i.e.-re;eet H. (the null hypothesis). His decision may “ 

correct as to the actual state of nature. Thus, as may seen i ’ 

the two correct decisions arc ( 1 ) that the com is fair m t c si * where 

coin is actuaUy fair, and (2) that the coin is not fair 
the coin actually is not fair. These may be slated m o g . 

accepting a true null hypothesis, and (2) rejecting a ^ the 

In either event, a decision is made in favor of ei^cx the n ^ 
alternative h>-poihesis. Acceptance of the null hypothesis 
decision ia favor of this hypothesis; rcjecuon implies a ecisi ^ ^ 

altemzdve hypolhesis, lor (he only lineal ahemalive to a decision that the 
coin is fair is the decision that the coin is not fair. . . 

Any given decision made by die seiemist may no. be “ ““ 
always Sneertain as to die due stad of nature For »»h'P '■ 
were tossed a million times and it came up heads on ^ 
is stiU a finite pnibability dta. the coin is a at, one. 
calculated by extending the multiplication rule to mor 
events: 

p{1.000.000K«od») = tJ0l<JWt-^®^^^ ... —.50 
Naturally, this probability is very close to 0, but it decision to 

even in the face of overwhelming to 

accept Ho is correct, an mvesugator may m tact u desig- 

this decision. Acceptance of a null hypothesis whic is m correct is 

naled as Type II error. Rejeclion of a null hypothesis that is m fact correct 

designated as Type I error. ■■ acceptance of a true null 

TTius, the two possible correct decisions are ( P jhe possible 

hypothesis, and (2) rejection of a false null ^ i error), 

incorrect decisions are ( 1) rejection of a true nuU hypothesis t type 

Ml U for Te«.rct«n to 

live hypothesis-^ g.. “the com Is fair*: Ra^r. ** «ject //-- Such s-* 

sute that “the nuU hypolhesis is not reyected o .ccepted” and a - 

—V coui.™.!.. thm fl« "■ ‘.’SSuiionv 


•e in accord with 


menis are more conservaiise tj,* r his cooeJusions. 

the scientist’s general lack of certainty as to the truth 


253 



PRINCIPLES OF STATISTICAL DECISION MAKING ' ^ 

To complete the picture, it is necessary “ ‘'“"of'plannermetiy^ ate 
valuable for the scientific process, as tte thcore i P 
valuable for the calculation of spatid relaUonsh p . 

aid to the scientist in the decision-making <»"»“* J decisions are based 

ally make decisions about the nature of quantifying various 

on a limited amount of data. Theorems sue . tnaximuro use of 

aspects of the decision process, enable the other scientists 

his data and. at the same time, to communicate effe y 
the basis for his decision. 

14.3 STATISTICS AND THE DECISIONS OF THE SCI 

Decfrionj. States of Nature, and Types of Error 

In Older to illustrate the waj in whi* P" “ it is use- 

matical statistics can aid the scientist m his decision-m g P 
ful to beyn with a very simple example— the tossing of • pje- 

this context that the basic principles of statistical <^cision "J ® 
senied. (Mote realistic examples arc provided in Chapter to.; 

The problem with which we begin 1$ one in which a ^ 

on the basis of a number of tosses as to whether or _ are thus 

one — fair being defined to be the situation where p(n) — • • 
two possible states of nature or reality under consideration: 
and the com is not fair (the latter designated as p(H) -SO- • 
two alternative decisions from which a choice is to be made, t foUowst 
and the com is not fair We may schematize ihis decision situauon 


TABLE 14-1 

ALTERNATIVE DECISIONS AND ALTERNATIVE STATES 

^ OF NATURE 

Decisions 

Coin is Not fair 
■ pfHi^.50 


States of Nature 


Coin is Fair 
p(HJ = .50 


Sec. 143] 


Stamiics and the Decisions of the Sctenttst 


TABLE 14-2 

PROBABILITY DISTRIBUTION OF NUMBER OF HEADS FOR SIX 
TOSSES OF A COIN UNDER THEjA^MPT[ON JH5J_^.^^7_:t£^ 


Number of Heads 


Probobifil/ ^ 

J/64 cr .0156 
i/bi or .0937 
15/64 Of .2344 
20/64 Of .3125 
15/64 Of .2344 
6/64 Of .0937 
1/64 or .0156 
Toiol 64/64 Of l.OOO 


Level o/ Significance and Region of Rejection 

These probabililics for each of the poswble c%ents, on the 
the null hypothesis is correct (the coin is fair), wnsUtute v limited 

lion for the investigator in making a wise decision on ih . 

data. In particular, these probabilities help the events or 

for deciding whether or not the coin is fair for each of 
occurrence!. Let us suppose, for example, that the ‘St 

there to be no more than a probability of .05 that he 
coin is biased when it is in reality fair. In such a situ^on, e making 

ingaU.c„„nh)poU..si..c,™aking,T,T.I,>TOrT^^P^ 

a Type I error is called the krel of signiflconce o .^^s^ihle rule for 

procedure and is symbolized by o. In this situation, then, o° ^ 
decision making is to set a, or the probability of making a yp 

no more than .05. . . ,. .• ,, ,n urhich 

Once this rule is set up, we may follow through its is 

events or occurrences should lead the investigator to 

unfair and thus reject the null hypothesis. After all, ^gator for 

for decision making is that they can ^thtStaUng 

any given set of results. Consequently, the next step 
procedure is to establish a region of reiecuon i.e., ^ 

lead the investigator to decide that the com is bias . » . vvilhout 

of rejection could have arbitrarily been established m the ^ 

the benefit of any statistical “uf decide that the coin is 

set up a rule that if 0, 1, 5, or 6 heads occur, he 

biased. Without the statistics, bowser, he will aid him in 

the probability that the decision will be mwrrect. . , ^lis 

malfing an accurate estimate of the probability o ype ’ ^ 



PRINCIPLES OF STATISTICAL DECISION MAKING ^ 

and (2) acctplance ot a false nuU hypothesis TOpe « ^ J^-„n, nU' 

a noil hypothesis constitutes either a Type jj „ a vatd 

acceptance of a null hypothesis constitutes eilhe ,u „ detennine 

decision. In either case, the investigator is never m a position 
for certain whether or not he is ha error. 

Probability Distribution Under the Null Hypothesis ^ 

Having outlined the relationship between the decisions for 

the states of nature, we may turn to the data whic 

making his decision. It is here that statistics plays a focus 

scientist "to make wise decisions in the face of unce attempt 

in particular on the binomial theorem discussM in . V gppjication 

to Lw how ,t enables the scienlist to utilize his data we 

of the mathematical formulations in probability theory. j^^jd for 

assume (and this can be checked) that the asioins of p 
the coin-tossing situation, and we also interpret the P used 

according to the frequency interpretation. Then this the h-nd^and these 
to make certain predictions on the basis of the limited da » 

predictions are utilized to make the fundamental scientific •. j-gisjoa 

Let us assume that the data on which the investigator ® results of sU 
u to whether or not the given coin is fair consist only of 
tosses of the coin. An important aspect of these data is the nu 
heads comes up in the six tosses — the particular order of occu 
irrelevant to the purposes at hand. Before considering . different 

emerge from a given series ot six tosses, let u$ analyze each of _ 
possibilities. There are seven possible occurrences or events: the nu 
heads might be 0, 1, 2, 3, 4. 5. or 6. Because the null hypothesis, 
hypothesis actually tested, is that p(H) = -50 or V4, we begin wi 
assumption that this is the true slate of nature. We may subsequen 
the decision that the initial assumption was correct, or we may deci e 
was incorrect. In cither case, our decision may either be correct or 
with respect to the actual state of nature. i le the 

Having assumed that p(W) =.50 or V4, we may proceed to calcu a 
probability of each of the possible events by means of the binomial theor 
For example, the calculation for p(0 heads) is 


OKi^ W H— = ,S, "VU- = ^ IW = 

1/64or .0156. 

Similarly, the probabilities of 1, 2, 3, 4, 5, and 6 heads may be calculated 

under the assumption that the com is fmr i.c., under the null hypothesis tha 

p(H) = .50. The results of these calculations are listed in Table 14-2.* 

• I »m indtbied to Prof. Elii»b«h Shuhactey of the Malhemalics Deparimeol of 
ton Univtmiy for many of the techniques of presentaiion utilized in the remainder 
this chapter. 

256 



5ec. 74.41 Relationship Beineen Type I and Type II Errors 

significance in advance. By proceeding in ihis manner, the investigator leaves 
his hypothesis open to aeceptance or rejection. If, on the other hand, the 
investigator examined the data before setting the level of si^i ttance e 
might be tempted to adjust the level so as to reach the conclusions w ic 
is interested in reaching. , , .. 

There is a perfectly legitimate way to avoid setting up levels of 
in advance, and this involves viewing the analysis as exploratory, ra er 
as a test of an hypothesis. In this way the investigator may free y procee 
examine the relationships among the variables in his study, e may c 
for any given null hypothesis which might have been set up the exact pr 
ability of rejecting it if it were actually true (a). 

For example, let us suppose that 0 heads occur in six tosses of a co n ^d 
that the null hypothesis, which might have been set up m a vance 
not), is that the coin is fair-i.e.. pfff) = .50. Then we might s.mp^ indicate 
that the probability of obtaining 9 heads in 6 losses of a air coi . 
without using this information to either definitely accept or rejee 
hypothesis. Uuliang this procedure, the investigator might ^ 

such calculations for all those relationships that seemed, on 
data, to be significant. 

14.4 RELATIONSHIP BETWEEN TYPE 1 AND TYPE II 
ERRORS 

In his use of statistical procedures for testing hypotheses. 
investigator make certain decisions before he can ^rfofrn making 

of these is his setting of the level of sisnificonce, or the probability o makmg 
a Type I error. However, such a decision is 

alion of the loss or disadvantage incurred by making a yP® . ® hecause for 
to a Type II error (accepting a false nuU hypothesis). nSv with 

a given size sample the probability of a Type I '^^l.innchio between 
the probability of a Type II error. In this section th^ntcrrela J jhe 
these two types of erior will be further explored. The aim is to 
student not with a detailed manual of statistical prp^e ure , scientist’s 
a mote thorough understanding of the role of statistics m the social scientist 

decision-making process. 

The "'Law of the Conservation of Uncertainty 

Thus far m have ilfusIruKd a type 

limit to the probability of Type 1 error. There is, nowev i, 
of error with which the investigator should be conce • . q-ype 

Ifedsioo that the coiu is fair »h=o it is reaUy biased. The probability 
U error is designated by 257 



10 pASl 3^1^ pSpiS-OAM JO p3pIS-3UO B UO 3pI33p 01 IBUOll 

n3AU03 SI JI ‘JS3J panspBis B Uijojiad o) japio ni ‘jcqi psjou sq pinoqs n 

uotspsQ 3iji fo Suimix 


•or ucqi sssj SI {//)d aisqw uoiju 
IS am lUOJj OS' = 3J3H« uopenjis aip qsmSupsip oj vuiq joj oisonoo 

lo si JI 'OS' isj^JaiS SI (H)d jou jo Jdqiaq^v ui Xiuo pajsaisjai si 
g(,s 3 Aui aqj ‘JS3J p3p!S-auo siqj ui ‘asireoaq spcaq o opnpm joa saop uoiSai 
^ ^ jfsi 3MX ■9S10' = ° ‘sp^Ml 9 JO sjsisuoa uoiSai uotjosLaj sjBud 

°°'ddE sqj D 3 qj 'SO' O' 1 iCJiliqcqojd aqj jiimj oj JojESpsawn 

0 3 JtS 3 P sqj uiBJW 3''^ JI 'wnpaoojd uotspap JO }S9j p^pis-no b dn jas oj 
^ siBudoJddB JSOUI sq pino.vi ji 'nopBmis b qons ui sirej jsao spBaq jo 
UI ‘-3 3 jBinaipcd b ni posBiq si uioo sqi jou Jo jaqisqiVi jnoqB 
^/°'^^^n 3 Ui 33 U 03 3 q jqSiui jojESpsaAUi 3qi ‘aidurexs siqj jo uoijbuea b ui 
* 1"° P ■ -JOJJS 1 sdiCx 

niM aq l^qJ sjsiYa aaiiBqD hbuis Xj3a b <ino jBqj aSpajMouq aqj 
® 3 jo wuajjnooo aiqtssod Km Joj Jtej $i uios aqj lou jo jsqjsq^i oj 

*Lsd b aXTJUJ oco jojb3ijs3aoi aip ‘airu b qsns qji^ pAai uiBjjao b oj 
^ adXi 10 Xjiliqvqoxd aqj Jiiun oi ajisap siq jo siseq aqi uo pajsiujsuos 
«ds SmxBUi uotsiaap Joj atiu e jojcSpsoAui aqi ssaiS osp Ji -so' «eq' 

. jjxj, JO XiqiqBqojd aqi jBqi aousjua iBijim sqi satjsijBS 
si U 103 Jrej c Xiteai si ii uaqAv pasutq si uios aqj icqj apiaap 
^ 'iridXu imu am joatai n'« JOlcSrjsaAOi aqj isqj Xjipqtqoid aqj ‘spcaq 
P"® “^“oTuoiSaj uopoalaj b qj,M XltO' 30 '9510' +9SI0' *] 

P P”* n,L ^ jcuJ cottduinssB aqi uo ‘spcaq 9 JO o »MJ!S |0 ^lU!qecioJ<J 
•jiEJ «! n‘05 '^g . 5 p 5 ^^ 9 put 0 JO S 1 S 1 SU 00 qoiqw auo st Biiajuo 

=* 'ISM P=P!S-0»1 = m!» 

^ ' ^,r, CS31 SI fwld l> puc 'OS' O®*!' 

Aciu Xliicnbaui uc qans JOJ ‘IMl pJp!J-o«l = ssildui! Xncppcm 


,np '‘’““s.'si tHldji P“b 'OS' »'1> “1”'^ '! (»)■> 1! >*X'“ 1» 

°ms=jHiu XlTienli™' <"= 1’"= "I PPPl'-^™ ' Xipraim 

»®s «! 11“=' • SjssmodXq OAireojJllo =111 II JP^II 9 JO 0 lo 

rant . Jjlorf, ulSiui ojoponojd uo,si==p lo JJJI pjp»-o.«i V 

00n==!=' ' =1“™' ’ JO oo.itiqiJlsip Xiiliqtqoid oqi Jo 

,,m=JlJ= Hbl >' p ' • •,„ ^ spjqiodXq lino =qi J=tll=q«> m 

OT=J1» =“» 5 „, Xq psqWJJ S! 001501=003 siql sptJq 9 JO 0 

St oonra!OJJ=l=P .I •(ototsqioS.t jo i=a=i sqj) 

^Tm=M I=dXi JO X,„.qoqoJd =qi 3o,Xj.==d5 =IOJ o qqtt 


SI»J1 pJP.'S-Pn'I P"P PPPlS-tiO 


•Io=uiiJ3d«= »qi J9 '>!“=' =91 XlJ=dold iq3!=Ai poe 
^r„r, niT> DtnoAV II ‘DOijippB UI •sosodind siq loj aAijoaija 
jajdjsjut o) °!q rpno, iqira nSpJl^o, 

’”.”3^' oNi-svw MOisiDaa qvDiisiivis dO sauioNiad 



S„.I4J] RtUuiondap Ba^rtn Typt I and Typp II £™" 

Ihe conscmdvs case, 5 heads »cre in the region (or f ''’P™ 
h)T»lhesis that the coin is fair. The snree .ould hold tor dre 
ibe coin is actually biased against heads coming up. • 

“1 heads” is in the rejecUon repon. while it is not for a - 

Wchave tried to show on an intuitive basis that a and ^ %ary „ 

might think of this as a kind of “law of the consenation o u 
analogous to the physical law having to do with the conserva , 

What it refers to is that, for a given set of data and a cn ™ ® , 

the probabDities of events, an increase in the probability of ^ 

serves to decrease the probability of Type H enor, an 'ice • 

an investigator decides to shift from an .05 level of sign ^^ce -rrnr to): the 

be pa>s a price for this reduction in the prob^ility o ype 

price is that the probability of Type 11 error (fi) wih mcrcas 
The Calculation of Type II Error 

We may now illustrate this relationship between « that 

calculations by comparing the decision situation i* k neces- 

^here a = .2186. In ordefto calculate for each of these 
sary to make the assumption that p(//) ^ -50. If we ^ situation 

probability of accepting a false null hypothesis, we 
where the null hypothesis is in fact false— Le., j ' .h of the 

errurnprion is rpZ .= may procc^ .0 lis. ass-X“ »“ 

possible number of heads that might occur 

done rr-ith the previous assumption that p(//) = -eO- soecific 

Bu, ue cauuo. ealcuiate these probabilities utJess ue 
inlotmatiou. To say that plH) ¥■ -iO leaves almost Mmplelely 
tion of fthat pf/f) actually equals. It may be equal to . * ■ ’ ' f 

inst about any other decimal hgure. In out prevtom^c^^ 

abihty distiihulion for the various numbeis of utilized the 

iufotmatiou that p(H) = .50. Without that, ue could uul have^u^ ^ 
theorem for calculating probabilities, for p{n) is o®® ^ value 

formula. Cousequeutlyfue must make a “ “ “ differe 

of P(H) iu the present insumee. It can be any = JOI 

from .50-i.e.. any probability for whi^ thj proceed to 

IS actually false. Let us assume that p(H) the data for a 

illusttate one of the calculations that will be uulize P tosses of a 

table of the probability distribution for numben of ea 
coin — under the assumption that p(ff) = -I®* 

p(0 heodi) = ptHl'-plT)’-' = 0>I6 — .lO^W) 

JL 1L909*= - tro)C90)t9CnC90)C.90)(.90) = 

1(60 * 



principles of statistical decision 

ut US continue with O'' ate ptob- 
up a null hypothesis that the com is fair and Ae ^ the 

,U =. T,p= I error to 05 Le. »s R.puOtesis 

null hypothesis being p(W) - -50 number of heads resulting front 

p(H) ^ 50, aud let the data cottstsl ot the h um ellect on 

^ix tosses as before. One question that might cons ^ 

the probability of Type II error ip) P ^ J^’Jjugator must F^V 

(a) at a given level. Intuitively, it would f ' ^_in other words. 

1 price, in term, of K. lor setting up nn . ol 0, 6 

that a and H vaty .aversely, Le. «s “'"P”' = .0312. »!■»' 
that ol 0, 1, 5, 6. With the smalkr rc,ee..on "S'” • „„„ eon- 

Ote large, one, . = .2186. WiO. the sntalkr « he ^ The onj, 

setvauve and will be veD 0“"“* "'“'“'a , h„ds but the null hypoth- 

results in the rejection region lor UiA a arc result. In other words, 

esis that the com IS fair will be accepted lot 1'. , 5^lt the investigator 

even il as many as 6ve heads ot as tew as one heads resuti, 
would accept the null hypothesB that lit' „ y,rrj to lavot ol 

But could it not occur very easily that the com is je, 

heads and come op heads five times in six ^ decision to ae“P‘ 

to to. ease of toe smaller . the investtgator would make toe decis^^^ 
toe null hypothesis that the coin » lair in this " “h a decision, the 

"5 heads" does not lie in the reiecuoit ie»on. By " „ hypothesis 

investigate, is making a Type 11 .rrot-..e., “'“P*'"? “ 'f to toe ease 
by deciding that the coin is lair when it is actually biased. At . 
of toe smaller a, il toe coin were actually biased =8“'“' J' ' ,he coin 

came up once in 6 tosses, the investigator would decide ^ 

is fair when it is actually biased and thus commit a Type 

The point being made here is that the investigator who is to 

respect to Type 1 error pays a price in terms of Type 1 '7°^; f the 
avoid a decision that the coin is biased when it is really fair, he 
probabUity of his deciding that the coin is fair when it is actually • 
can be further illustrated if we consider the decision situation re a g 
larger a, where the rejection re^on includes 0, 1, 5, and 6 hea s. ’ 
probability of Type 1 error (a) is .2186, which is much greater 
situation where a = .0312 as in the previous example. If an inverse r 
ship between a and p exists, we would expect to find a much sma e p 
a = .2186 than with a — .0312. . , j - q 

This is in [act the case. With a = .2186. the rejection region ’ 

1, 5, and 6 heads. If the coin were biased in favor of heads, we wou e 
more likely to detect this if our dedsion procedure is to decide that * ® 
is biased if as many as 5 beads occur than if we were more conservauve. 



14 fl RctalionMp Type I onJ Type II Eeeoee 

eippyppe f»rp(H). In o«.er .ords. U.n S, 

diSerent wajs because a biased com may ha%c ^ 

no general formula has been developed for calcula^g P “ 

The calculation is diSerent for each different assumption v^e make as to pt 

Setting the Le\'el of Significance 

Thus far we have discussed the way in which as weU as 

can be developed on the basis of limiting « to « P™Lcver. has 

the way in which « and ^ arc related to one juje, the decision as 

been said about another, more general dec crucial question for 

to which level to set for a and, coosequeoUy, B- . ^ tg attaches 

the iavestigator in setting a has to do with the re a v , ^ eaample, that 

to maiing a TjTto I and a TyTMt II crr»^l^t to his 

his deebion as to whether or nor the iadividnal where he 

entering or not entering a gambling game a. win or lose depend- 

wffl bet Sl.OO on each throw ol the com and wh^e m ^ 

ing ca whether the other individual guesses co y Ip jjp quite 

hiras up. In snch a sini.Uon, unless Ote xTt>c H c™'" 

wealihy, it will be very importMt j"? " mally^ied. If he males a 
Ic., deciding that the com is fair when it is ^ gambling 

Tj’pe n error and decides that the corn is i-ppftant for the investi- 

game and will probably lose his shin. U may ^ biased when 

gator to avoid maldng a Type I error — i.e.,deci i g 2186 would seem to b® 

it b really fair. In such a is because, on the assum^ 

awiserstrategythansettingaequalto.031/. former case while 

tion that p(H) = .10. for example. P = ;» •; investigator wiU decide to 
P = .4686 in the latter. Thus, the against beads is much 

enter the gambling game when the com is heavily biased ag 
greater for o = .0312 than for « = .2186. ^trateov may be undesir- 

For another type of individual, incffvidual who enjoys 

able. Suppose, for example, that we have -.ffending the other player 

coin-tossing games and who would be afrai selection of « = -0312 

by refusing to enUr the game. In such a ^-ould be more con- 

makes more sense than a = .2186, because ^ biased when it 

cemed about making a Type I error (decidmg ^ deciding to 

•is actually fair) than a Type H error. The latter wodd Uad t ^ 
enu, toe gctoe wito n blL^ coin and toe .= -0312 

constitute a relatively small loss for a weal _y)P. . . cares more about 

strategy would be a better choice for an mdividual wno 
making Type I error than Type ^ tol decision making. For eaam- 

Other considerations are involved m slatis _•. « associated not 

pie, toe invesrigator woold do weU to^e ““ IpW = •>»! 

only with one particular assumption about m 2^1 



[Ch. H 


principles of statistical decision making 
The entire table follows. 

r ~ TABLE 14-3 


Number of Heotfs 


.5314 

.3543 

.0984 

.0146 

.0012 

J3001 

0000 


Let us assume that we still have the same null hypo es^ sltuatioo. 
fair. We may begin with the calculation of p for ^e a ' yjgion 

Here, the rejection region consists of 0 and 6. and the P . jggioo 
consists of 1. 2, 3, 4, and 5, It is the acceptance ' „/fC„obability 

which is of interest here because we are concerned with p,o p 
of aceeprins a false null hypothesis. ,„w,snitles for the 

If the state of nature is that p(W) = .10. then the prob 
occurrence of 1, 2, 3, 4. and 5 arc (from Table 14-3) • w^LothesU 

.0012, and .0001, respecUvely. The probability of accepting the null nyp 
is the probability of either 1, 2, 3, 4. or 5 heads. Thus, the ^ddihon tu 
and the sum of these probabilities is .4686. In other words, ^ 

Now we may calculate p for the a .2186 situation and compare ^ 
the p just calculated For a = .2186. the rejection region consists ot 
and 6, and the acceptance rc^on consists of 2, 3, and 4 heads. ® ^934, 
abilities corresponding to the events in the acceptance region are ■ 

.0146, and 0012 (from Table 14-3). Because the probability of 
the null hypothesis ()3) is equal to the sum of these probabilities, p — • 

In comparing the a = .0312 and the a = .2186 decision situations, we 
that the p's which correspond to them arc .4686 and .1142, respec v ^ 
These calculations show that a shift from a smaller o to a larger a * _ 

shift from a larger /? to a smaller p, and wee versa. Thus, we have an 1 us 
tion of what may be facetiously called the law of the conservation of ' 
tfliniy. It should be noted that the “law” does not state that the W 
probability of error (« + p) remains constant. In the illustration wher 
a = .0312, = .4998. In the utuation where a = .2186, a + ^ = ' ’ 

Actually, what happens to p when a is changed depends on what value w 



'i^J a 



ii 


Sampling, Statistical Testing, 
and Degree of Association 


15.1 PROBABILITY SAMPLING 

TheNatureof Probability Sampling «„t,.tt 

Certain aspects ot sampling procedures were taken “P referred 
of the collection of data (see Section 4 . 2 ). Probab i ^ P uniy 

to as a method for selecting a subset of units ooLlation has a known 

(population) so that each unit or subset^ units of 

probability of being included in the subset or samp 
illusuatcd in the context of the coin4ossing cx^p e. resulting 

W, may designate as .he unit .o be 

from six tosses of a given coin. The popul;tf ion may infinite number 

number ot such units, with these data •*“? number ot heads 

of sets of six tosses of the com. Let us consider ^ probability 

resulting from one set of six tosses may be consi c oossiblc event 

sample. The criterion here has to do with whether or of occur- 

-1.1 each number ol heads from 0 ro 6-has a tnonu P™t“4“‘na. .he 
ting. Table 14-2 lists these .ere calculated by means 

com IS a fair one. As explained in Chapter ^r axioms. Also, 

of a theorem based on the multiplicauon and infinite number 

P(H) was interpreted as the relative frequency o . coin- 

of losses. On the assumption that these awoms a . . occurring, 

tossing situaUon. each number of beads has a . P . j addition rules 
But how do wc know whether or not the be indirectly 

or axioms actually apply in the coin-losang the calculated 

tested by a very large number of sets of 6 tosses. multiplication and 

probability of 0 heads resulting from 6 t<Kse$. u^ ^ ,gj yj 

addidon rules, is .0156 (from Table 14-2). In lOW ^ ^ 

suppose that we actually obtain 14 Mluations w cre 



PRINCIPLES OF STATISTICAL DECISION MAKING ^ ^ 

but the /3’s that would correspond to many other probability 

In addition, there are generally many ways of cst g ^ 
distribution for a given research situation. In addition. ^ 

question as to whether or not additional data and the 

making any decisions. Each of the choices affects cv ^ difficulty of 

investigator must somehow take all of them into acc • . . procedures 
handling so complex a set of factors is obvious, u oriented to the 

provide the means of making decisions that arc pragma y 
achievement of these goals. 

SELECTED REFERENCES FOR FURTHER STUDY 

• • Theory (Ne* 

Herman Chcrnofp and Lincoln E Moses. Elemertlary Decision ^ for 

York; John W.ley and Sons. I9S9). Decision theory is use ^ ^ppro- 

clarilying some fundamental principles of statistics. T"‘* j, a subsidiary 
priaie for students interested in going beyond the use ® , ** pr in ‘be 
tool for data analysis to its use in mathematical models ol 
development of strategies and tactics of scientific investigation. 

Darrell Huff. How to Lie nir/i Smifslics (New York: W. y* ^f^^yspeiated 
This non.technical book surveys many different types of la .„,j,pr: "The 
with inappropriate uses of statistical reasoning. According to I e 
crooks already know these tricks, honest men must learn them m 

W Allen Walhs and Harry V. Roberts, Siaiisilcs: A New oa 

York. The Free Press of Glencoe. 1956). The "new approach » w j, 
the strategy of statistical decision making; see especially Chapter ' J^g^pty 
much readable background information about slatistica in this m ^ 

textbook The fact that statistical inference is almost completely 
consideration of the normal distribution makes the topic especia y 
hensible for the reader who is new to statistics. 


262 



Probability Sampling 


Si« of .ample >00 10l« 

Relative frequency of Republican. (.5300 = acfool 

in the wmple proportion of 

Republican^ 

Here, as in the coin-tossing situation, successively larger 

yield relative frequencies that am closer and closer to a pven probabthty ho 

the coin-tossing situation this was the calculated probab.lity 

tosses, while in this exantple it is the actual proport, on of Repubhcans .n the 

''“hypothetical data also illusuate the efficiency of 
techniques A sample of 2000 from a total electorate of one m Ihon co«s 
only tLtenths of 1 percent. Yet such a small Sl“d 

can provide data which very closely approximates wha w 

if a census of the entire electorate were lakcn. r , .um enrial scientist 

The general utUity of probability sampling techniques 0 

stems from his desire to provide evidence for f 

scope on the basis of limited numbers of individuals. He w probability 

propositions that hold for all human beings. Given ' ® g„Qyn,ously. 

sampling techniques enable him to stretch ® populations 

He is able to draw conclusions about the behavior ^ ^ 

(for example, the electorate of 1 million ind.v.du^s) 2000 

obtained from relatively small samples (for examp , 

individuals). 

Some Types of Probability Samples 

SIMPLE RANDOM SAMPLES lottery Let 

This type of sample may be Ulustrated Uie sample is 

us ima^ne that each individual m the populauo 

to be drawn is represented by a slip of paper. the 

very thoroughly mixed in a bin. Then v a simple random 

bin, with thorough mixing taking place between > 

sample of individuals (represented by the slips) y 
population. . ,;„_v random samples 

Social scientists do not, of course, actually o mn P individuals in 
in this manner. A typical technique is to ^ . individual. Then they 

the population and to assign a unique number nhtainine numbers 

select as many mdfviduaf, as they desire S,frS“<c »■= 

from a table of random numbers and using oenerallv presented 

bdividuals in their sample. These random numbers a g 
in a format such as the following; * , n ^,mal Didis (Wash- 

* Inier&tate Commerce Commj»k»n. Tabit ^ Tables of random numbers 

ingion: 1949. Siatement No 4914. File No. 26I*A:«. P- 
may be found in most standard twlbooks on sUIkuw- 



SAMPLING, STATISTICAL TESTING. ASSOCIATION 


[a. 15 


SAMPLING, ..-A...—.-- n heads in 

come up tails. This would constitute a idativc ^ toss the 

6 tosses situation of 14/1000 or .0140. Let us n ^ 
coin 9000 additional sets of 6 tosses, or “ ““I ^ ,^,5555; leie 

up with a total of 162 situations where “ m ui us suppose 

the relattv. frequency would be “'f ‘^JJtls’of 6 tossk 

that we obtained a relative frequency of .013/ ■ . ^ calculated 

These hypothetical relative frequencies tend to eon « „t sets 

value of the probability of zero heads m 6 tosses (.0156) ^ 

of 6 tosses increases; _ _ w- — * 


1000 10.000 30.000 


1 Selolivfl frequency of 
i 0 head} In o >el 


.0140 .0163 .0157 



If, indeed, data indicated that the latEcr the sample, ‘'“md^mSute 
calculated probability, this jjr 


relative frequencies were to the calculatco proDaouuj. or 

t of the idea that the multiplication and ™ tJd 


evidence in support ot me loea inai inc - - j. ihjre wouia 

axioms actually apply to the coin*tossing situation. In ot _ „{ prob- 

be evidence that the interpretation of the mathematic^ w,„rtnce of bead* 
ability as corresponding to the relative frequency of the oc 

a coin-tossing situation. _ • » the nature of 

Although the coin-tossing situation can aid in portraying 

• its moortance to uic » 


probability sampling, it is diRicuU to understand its impor an (he 

scientist unless we talk about the sampling of human popu * tech- 

social scientist engaging in surveys, for example, probabi1ity*sa " . 
nifiues enable him to select a s.'imnie in SUCh a waV that it Will in _.,.a 


niques enable him to select a sample in such a way that . s-tefested, 

be highly representative of the larger population in which he is jample- 
with the degree of representativeness depending on the size ot |gg;ion, 
If he desires to predict, for example, the vote in a given municip 
such techniques will aid him to do ihis and still interview only a limi ^ 
her of respondents constituting only a small proportion of the 

To illustrate this, let us assume that the electorate consists of a 


1 million individuals, and that 530,000 (or 53 percent) plan to vote 
Republican candidate and 470,000 (or 47 percent) for the Democratic c 
date. By using probability sampling techniques the investigator will ^ 

be able to avoid the selection of a sample that is overrepresented by Repu 
voters or by Democratic voters. Also, the larger the sample, the greater w 
the likelihood that the percentage of Republican voters in the sample w 
the same as this percentage for the entire municipality. For example, data 
diHerent sizes of samples might look like this: 

* The prediction of eleclion Tctuins is not a simple task, due to such factors as a su 
jiiTe. in votine nreferencc subsequent w 


the survey Sw..,,. umikiuics mvoivea are ouiimeo in raui r. i .*--— - — — . „ 

Beielson, and Hazel Caudel, The People's Choice (New York: Columbia Univers y 
Press, 1948), 



stc.is.n 


Prohabiliry Sampling 


Suppose, tor example, that »e *ish to obtain a simple '“f ™ 

10 students tram a total class of 94 students. We could prKced by "S 

all of the studenU and assigning to each a number from to . . 
dents assigned I-digil numbers (1. 2. 3. 4. 5. 6. 7. 8. or 9) ^uld base theu 
numbers recorded with a 0 preceding them (01. 02. 03. - • ■ - ■ 

or 09) so as to utilize the table of random numbers more 
could then be entered at any point — c.g., the sixth ° ,,5- , 55046 

the researcher could proceed horizontally from left to ngh - .j 

18076 12415) to the end of the row. He could then start row . ^ ^ ^ j 

from left to right (61078 52433 . . .). etc. The "-mlxrs could ^ ^coided 
inpaim-i.e., 82, 85, 45, 58, 46. 18. 07, 61. 24. 15. These ten numbets 
could then be used to dcsig:nate the desired sample of ten m vi . ^ 

The student may vtell nonder how such tables o', 
constructed. The pcneral ptocedate is to locate a "ti'"''" of Ute ten 

that can be used to generate numbers in such a way . of 

numerals has the same probability of appearing, a l nutner- 

numcrals (e g., each pair) has the same probability of occurrence 

all generated are then tested, using their made as to 

as an interpretation of these probabilities, and ^ 
whether or not they constitute a set of ""‘["^'produced by summing 

Each of the above random digits, for CKJ^plc. p . , .-wjjon— e-g- 
10 di^ts, each of which had to do with a different type 
shipment weight, revenue, serial number of the j fp^ table 

resulting sum was retained, and this constituted | various tests 

of random numbers. The resulting table was then J Hieil's occur- 

for randomness. In one test, for example, the examined, 

rence was listed, while in another test adjacent p^rs o g . tj^je 
Results of these tests indicated that the table did m fact consUtuie 
of random numbers. 


STRATIFIED PROBABILITY SAMPLES dividing 

Stratified probability samples are probability simple random 

the population into strata and then selecting, for exa jo students 

sample from each stratum. Using the above ®*^P ^ to be inter- 

are to be selected from a class of 94, let us u-vire female candidates 

viewed about their conceptions of the desirabiuty o ejitent from 

lor president. Presumably, male students would va^ ^ sample 

female students on these opinions. H the rescarc er mieht do well 

of 10 which was genuinely representative of the ’ underrepresented 

to make sure that males were neither overrepresen 

in hb sample. - „ an assessment of 

The actual sampling procedure might mvolv , assume that this 

the ratio of males to females in the class as a w o e. 




266 


Tens «'/ Sii:nificanrf 


Sfc. 15 J] 

aisscd in Chnptcr 14, in Ihc conlol of n coin-lossing siluauon. 

^prehcnsnc Jiscnuion of slnlUticnl tc^ ns »cll ns mnnsnrcs of nswc.anon, 
$« ilc stanJarJ textbooks on the subject.) ^rnr- 

Using a probability sample, each excnl has a kno«n probability o cKcur 
recce. Thus, in the coin*ti>ssing situation, ue arc able to ca cu a 
ability of any giscn number of heads and. consequently, arc a c 
hypothesis at a given level of significance. «^m- 

While the magnitudes of « and arc clear in the ease o a p 
pie, we arc unsure about the probabilities of Type * ^ imoor- 

probabihiy sample. Granting that non-probability ^ 

lant place in data-collection procedures, one question 

ivith the appropriateness and the imerptetation of stalislica jf it 

samples, it might be argued that any procedure might 
aids in the contexts of discovery or justification. With a non pr 
ue would be unsure about a and /J— or, more accurate y, « , tc^t. 

sure about a and /f than if v.c had a probability sample 
however, can provide a means for distinguishing those context 

to be more vafid than others, and thus it might U of bo h ^ 
of discovery and in the context of justification. Of ecu . decision pro- 
mind that Ac a. I^hlch provides the criteria for a given ^ 

ceduxes through its determination of the rejection j ^ 

<1=41 from U.C . ,hi=h is scluall, <»rK<4. D«. .his “S P^ss 

constitutes only one of Ac many uncertainties involve 
From a strict point of view, of coui^, any genera because it is 

laiion on the basis of a non-probability sample is no ^ sample 

not possible to calculate Ac probability of Type I c«or “ ’ . other 

itself has no known probability of occurrence w en 

possible samples. It is possible to adhere to I is -.vj^ jhc context of 

same lime, utilize Ac advantages of statistical , • section 14.4 

.Imntry. The approach lhat might be °L ejploralory bul not 

and has to do with the urn of of his 

for tesung purposes. Thus, the investigator wou ^he 

calculations to decide that a given hypothesis was ° ^ he could pto- 

larger population from which Ac sample was drawn. * actually 

«ed to calculate Ac probabilities of ^„.f"calculalioDS is based on 

ime (q). He would realize that Ac accuracy o -heless might use Aem 
Ae assumption of probability sampling, but he neve jj, future 

to locats those hjpothesos »hich seomd to be oorth follovti E P 
Studies. 


The Binomial Test: An Example of a Siatistica ^ variety of Ae 

Numerous statistical tests have been has been selected 

decision-making purposes of researchers, «J° jj^dition it has already 
for detailed illustration here because of ns simplicity. 



SAMPLING, STATISTICAL TESTING. ASSOCIATION 
ratio is I ; I-i.e., there are 47 males Ntd « J,”' ^fthis rano is 

then choose h.s sample of > This might be accom- 

retained-i.e.. he might choose 5 males and 5 fcma 

phshed, for example, hy ,‘>“‘8”'“? “ ”“‘>“y^tf sandorn numbers. The same 
then choosing 5 males with the aid of j be a 

process could then be repeated for “f^'l'^^he basis of sex. 

probability sample of the entire class wh.ch ts steal, lied on 

CLUSTERED PROBABILITY SAMPLES _ tTroUpingS 

Clustered probability samples arc probability *“"1^*5* of a proh- 

or clusters of the units to be sampled are first selected y 
.b,hty sample. Let us use for dlustrattve within a 

the political preferences of an electorate of technique would be 

givei municipality are being investigated. One of indi- 

to utilize the census tracts wiihm the cHy as the cluste g 
viduals, and to first take a simple might proceed 

wiihm each of the census tracts thus selected, the less 

to take a simple random sample of the individuals re ® jg ^ city 
expensive approach would be to every nth house- 

blocks (constituting smaller clusters), and finally to select > 
hold (e g., every third one) within those blocks . .ijne. The 

The advantage of clustered samples is the savings in 
investigator in this example saves greatly in interviewer ^j^g 

all interviews are concentrated within a number of speci c ■ ^ 

survey can be done much more rapidly through this _.gsentlng 

of the city. A disadvantage is that there is a greater danger of o F 
or underrepresenting a particular group than in the case of si p 
sampling 


15.2 TESTS OF SIGNIFICANCE 


Tests of Significance on Non-Probabitity Samples 

Stfliisii'ctj/ tests of significance refer to decision procedures for 
rejecting a given null hypothesis The prindples of such procedures were 

“This IS an example of piopoilional stratiRed sampling- If / jp'fopor- 

more information is needed from one siruura than from olhers. he may ihe 

lional siralified sampling iccbniques For example, suppose lhal the attiiuaes 
females were relatively homogeneous while those of the males varied u, pnal 

situation the investigalor might select seven males and three females, and m ^ 

analysis weight each female's opinions more than that of each male so as to se 
represeniaiivc picture of ihe classasa wfhole. . 

• The last stage of this multistage sampling procedure, wherein each «lh unit is se ® ' 

illustrates sysiemaiic somphne techniques. With this procedure we start with * 
starting point among the first n units and then taVe every nth unit. This techniq 
much more convenient than simple random sampling One disadvantage of il i* 
every set of elements does not have the same probability of being chosen. 

268 



Sfc 15 J] Significance 

ihc tnincial aspects ol a home tor the aged arc individuals »ho are mote Ukely 
to choose the homemaker rather than a home for liw age . us, 

sided test vkould be most appropriate in this situation. Our )'po eses 

Ha.ptJ0 = .50 
Hitpyo> JO 

The inscsligaior must also determine the Icscl 
set or the probability ot a Type 1 error, at a certam level, ■•'"f / f '' 
tniaing the probabUity ol Type II error <iJ). What rs usually 
a balandDg of the relative harm resulting from each tj-pe ngnjouUi’s 
taliag into aeeount the sire ol the sample involved. Aceordmg to tonooa 
theotem, the larger the sample, the eloser its eharaetensucs » ^ “ 

population from which it was drawn. Thus, esen U a given nu depart 

correct for a given population, the characteristics of a sm s p 
a good dcalLm^c general pattern. In other wor^. 
of making an incorrect decision tends to be greater for P 

for alargc one. Thus, if we set « at a low level, - ^ sa5= 

tample, /J will be very high. If we hold a constant an . 

size, 3 will be eoncspondingly reduced. In the pr«eD 
»ell to set a fairly high so as to reduce the very larg ^ jjjgQ 

wise result. Because a = .05 is cortvcntional for nrobabilitv of Tj-pe 

seta = .10, By adopting such a procedure, we reduce ep false. 

II crrcr_i.e..'acccptance of 01 ^ null b>Tiotheses wh=n 
Consequently, wc reduce the probability of ^ one 

there is uo relationship between Ihc two inciSL the probability 

docs actually crisL Thus, while setting • ^ 5 . Oic probabUity o! 

of finding relationships that actually exist, it also 

finding relationships that actually do not exist. . be mvea to 

Before proceeding with the calculations, some “ E^en if the sample 

the faa that we are dealing with a alf women in Cambridge. 

of 192 women were a probability sample ^awn fro financial aspects of 

the four women who happened to mention ,amnle of any particular 

homes for the aged would not constitute a probabi 1 y P . have no 

population. Because we arc using a statisucal tes • to which 

probabUity sample, it is well to remember that there is legitimately 

we can generalii our results. Nevertheless, staustical analyses may g 
he perfonned in an exploratory vein. number of home- 

We may now calculate the probability disinbu p(K) = -SB- 

maker choices by four respoudenls being made 

Table 15-1 is directly analogous to Table 14-2. rejection 

is a one-sided one, and in particular, //i spedfies ^ the 

region includes only the event “4 homemaker to Hi. Fur- 

side of the distribution of homemaker choices w tc 



been ut.lieed in Chepler 14 within the 

The aim here is to show how a statistical test is -ealistic than that of 

more realistic example. Although the 

coin tossing, the degree of realism is sUU nVprob- 

Table 13-1 was based on a study of the healt H,e data 

ability sample of 192 women from Cambn gc, j ^ homemaker 

from the table deal with the relationship between a ^pg„. 

or a home for the aged and the appearance of oroceed by setting 

ended question as to the reasons for the choiM. We '" V P between the 
up a null hypothesis to the effect that there is no re , constitutes a 
choice and the various reasons for the choi«. Such a P'’° consequences 
reduclio ad absurdum, for the null hypothesis is set up so i-ich actually 

may be examined, and if the probabUity b smallest ® y/e shall 

did occur can occur under this assumption of the null hyp • 
decide to reject the null hypothesis. . ,.,«jqs for the 

Let us examine one of the value expectations given m th 
choice of a homemaker or a home. The data on this pomt are. 


I 

I 

^ Vpfwe E«pe< foHqn« 

I Finonclali 
I negallva on hotn* 
j for «g«d 


cbo/c.of 

Homemoker Heme for Ag 


"Nat aneugh monay 


The number of individuals who were negative on the financial asp . 
home for the aged, out of the oii^al sample of 192, is only 4. pjej 

it is still possible (although unusual) to conduct statistical tests wi t^jee 

of this size. On the assumpuon that there b no relationship between c 
of homemaker or home for the aged and whether or not the ^®P°° .o. 
negative on the financial aspects of a home tor the aged, the probab i ^ 
any ^ven respondent who mentioned thb dso chose the homemaker s 
be equal to the probability that any given respondent who mentioned j..„ 
chose the home tor the aged. Thus, the null hypothesis is that the pro a 
that a person who is negative on the financial aspects of a home for 
chose the homemaker, which may be desi^ated as p(K), is equal to . • 

We may next decide as to whether the test b to be one-sided or two-si • 
A two-sided test would be appropriate if we had no more reason to suspec 
that p(K) is greater than .50 than to suspect that it is less than .50. Actually, 
however, we have reason to suspect that p{K) a greater than .50, because i 
is eminently reasonable to hypothesize that persons who are dissatisfied wi 
270 



Steaiuret of Association 


See.fSJ] 

in «hich choices arc concciscd of ns n piodnc. of cspcctations conccnnng 

^ The foregoing illustration indicates one of many possible P'“'j J” 
pcrlomiing a quLtitative analysis on quaUtatisc data. A gnen P 
“red not L diarized into cMer a nonAtatistical study or “ 

Unfotiunalely, much research in the social sciences re cc s j, por 

as ncll as a lack ct appreciation tor die utility of th^ h t aPP™-*. 
example, participant-obsersation studies rarely .jPg crucial 

analyLs: ihe lane, are, of coutse, not neeessmily '"-PP”*" ; 
quesU U nhether any gisen method of data eolleaion or an^ysis is re,ec 
because of its lack of utility, or uhclhcr it is ® 


15.3 MEASURES OF ASSOCIATION 

The behavioral scientist is interested m mote a pvea 

or not a relationship exists bct'^ccn a gi'cn indepen j.. not be 

dep^ndervt variable" For example, knowledge ^ the 

greatly advanced by the information that a rclab P whether this 

tac a'body is in free fall and ila scloeily. I. is “ Tpo,S (actor 

independent variable (amount of time m '7® ‘"‘J . , factor among 

in predicting distance fallen or if it is merely a r y j-.gfjnjne which 
other more important ones. When the scientist is ^e explain- 
among many possible independent .^°L,as rnade a considerable 

mg and predicting a given dependent relationships Involved, 

advance in his investigation of the nature of the OT . j^ponance of age, 

Thus, for example, he would be able to specify aeniration, and many 

sex, religion, social class, group affiliations. ^ behavior patterns, 

other factors in explaining the development of ^.,._nbied by a test of 
It might seem that degree of iclalionship can small value — 

significance. For example, could we no‘ a . ^esis is so deci- 

e g.. .001? Then would it not be true that, if the ^*^55^37 -phe answer 

sively rejected, a close relationship ****®®", .u. ease of rejecting the 

is “No,” because the size of the sample greatly a variables might have 

null hypothesis at a given level of agnlficance. Two g might be able 

the same degree of relationship in two differcnl samp , large while we 

to reject the null h>polhcsis at a = -0001 if th® SiOTp _ lo if the sample is 

might not even be able to reject the tlie degree to which 

'ery small. There is no necessary relationship it is pos- 

two variables are interrelated and the level of sigm 

sible to reject the null hypothesis. of relaUonship 

It should be recognized that the finding oI ^ relationship 

between two variables does not automatically guara 



SAMPLING. STATISTICAL TESTING, ASSOCIATION 1“^^“ 

ihcimorc, this is the only event lot ''u the everts “3 “ ^ 

less than or equal .0 the ^ „gi„n, .hen . would be 

homemaker choices” were placed „ceed the linrt "< 

equal .0 .0625 + .2500, or .3125, which would greatly 


.10. 


TABIE 15.1 i 

PROBABIUTY D1STR1BU110N OF °/„“^SUMPTION I 

CHOICES FOR FOUR RESPONDENIS UNDER THE ASSUm 

THAI . - ’ - 


■nemakef 

.. __..f — We..---=-T- 


J/U or .0625 
4/U or .2500 
6/16 or .3750 
4/J6 or JSOO 
V>5 or .0625 
r VOOO 


U/U 0 


la the actual data, where all four of the individuals w o choice, tiJ* 
the financial aspects of a home for the aged made the homem 
number of homemaker choices falls within the rejection regi . jg^ult? 
thus able to reject the null hypothesis. What can we now do j^e 

Certainly we cannot state that a relationship between dissatis grists fot 
financial aspects of homes for the aged and the homemaker c oi 

1 the city of Cambridge. But. after ^1, such a stalemen^o 


seem particularly earth-shaking, nor does it — in 


'that form at least-seem 


— m that lorra gstated 

^ve promise for the advancement of science. If the relationship^sj^ 


in its more abstract form, however, its implications will become . 

At a higher level of abstraction what has been examined is the 
between values and choices. In particular, the finding provides an 
of the general proposition that the individual's choice is related to hiS e p 


tions concerning the goals involved. . , 

Of what use is this finding? This depends, of course, on what is on 
it. In the context of justification we can do very little with it. Althoug * 
on a proposition relating to choice behavior, the non-probability ° four 

sample permits us, from a strict point of view, to talk only about the 
individuals for which we have relevant data. Within the context of disco 
it provides the investigator with encouragement to pursue a line of mqu 



Sec I5J] ^^easllres of Association 

is ihe degree of linear relationship between the two variables. The computing 
formula for r is 

NSXY — CSMtiY) i 

VlNiX^ — Cyo-IINSV'* — i 

Here, N corresponds to the number of instances — in the present illust^tion, 
3 (students). In addition to N, there arc five quantities needed m order to 
calculalc r; :iX, iV, SAT}", SAP, and II should be nol^ “’“Vy I 
is calculated by summing each of the products of X and Y. us, ~ 

(lX3) + (3xn + (2X2) =3 + 3 + 4= 10. Ain, SAP 

calculated by summing the squares of the variable m que - > 

SA:2 = 1* + 32 ^ 2* = 1 + 9 + 4 = i-1- All the calculaUons for these hve 
quantities arc presented in Table 15-2. 


TABLE 15-2 • 

CAICUIAIION OF r, THE PRODUCTJAOMENT 

CORBEWIION__COKFICHNT .— 4 

' X V XT . .r 


1 


1 


1 


SYS = 14 


sis'e jY = 6 sxy = io sx* = i4 

3( 10)-<S(6) _ 30-36 = 

’’ VT3ll4) - 6^1 13(145'— "Sn V<42 -3" 


Thus, r= -1, indicating a maximum degree of 

the two variables, although in opposite directions. Thus, ^ alow 

medical students, a high degree of interest in research is a«oci 
degree of interest in close relationships with patients, an vire ' . y ^ 

We might also calculate r for the following siluauon, w ” students 

the same variables as in the previous iUustraUon. but three different students 
^e involved: 


275 



SAMPLING, STATISTICAL TESTING, ASSOCIATION 

IS IhErcforem important one. For cxampl^^am^'” ^ sitoalion, 

tclalcd because some thud variable C ca^ illustration of the »=1I' 

does not cause B nor does B cause A. and ihrt a an 

known saying that "cotrclabon is not ,„„tage dilierence. 

One or the simplest measures ot J'U B- “ 

Table 13-3 showed that the percentage of students m the 

is substantially higher among th^ am ll„, staw interest. By sub- 

assgnment than among those students who . . jjjnong 

traSing the smaller percentage <*' ® eentage with B- 

those now showing mtercst) from “"i.'”6“ “ ' * yetoobtainameas- 

or higher among those now showing no inlcKrt L . In this example, we 

ure of the degree of relationship between the “» te compared 

have 58 percent - 25 percent = 33 petee^ Tb s g^ J 
with others such as that in Table 13-6 There the g 
background was related to grade, with 41 percent of ij science 

courses obtaining D- or higher and 38 percent ot lbos without » 
courses obtaining B- or highct. The percentage dilTere h , ,i„„ship 
- 38 percent = 3 percent. In this way we 'an thw tat 
seems to be much closer in Table 13-3 than in Table die 

Perhaps the most widely used measure of ji^piy desig- 

Pearsonian producl-momern cotrclaiion coefficient r ( y 
nated the correlation cocfficienl). Wc may illustrate value 

medical student study. Let us designate the degree o i jAtSpnate degr*® 
close relationships with patients as Variable X', also, Ic u medical 

of interest in research as Variable y. The scores for three of the tn 
students in the study were: 


X r 

Sludettl At® 

Student B 3 t 

Student C 2 ® 

The correlation coefficient provides a measure of the degree to 
two variables vary together, where the relationship between the two i 
ceived of as a directly proportional or linear one. If they vary in 
directions in a perfectly consistent linear fashion, as in the above i us ’ 
r = — 1 . If they vary together in the same direction in a perfectly consi 
linear fashion, r= 1. If there is no linear relationship between them, 
r = 0. Thus, r can take any value between —1 and +1, inclusive. T e c o 
it is to +1, the greater the degree of linear relationship between t e 
variables m the same direction. The closer it is to -1, the greater the 
of linear relationship in the opposite direction. The closer it is to 0, the sma 
374 



Selecsrd References for Further Study 
malical statistics are presented in a way which is understandable to the intel- 
lectually curious and serious student 

Leslie Kish, Survey Sampling (New York: John Wiley and Sons, 1965). This 
book is designed to provide a working knowledge of practical sampling 
as weU as their slatUUcal basis. It is oriented toward students who have had at 
least a basic course in statistical inference. The emphasis and illustrations have 
to do with surveys of human populations. 


277 



SAMPUNG, STATl^CAL TESTING. ,,.'“1= 

For this illustration, it is not the numerator ha^e 

numerator is equal to 0 anJ 1 1 8 - 1 8 = 0. Thus. / - 0 

S-S.XY - (2X)(2y) = (3)(6) .j ^ students between an interest , 

and there is no relationship among these . 

m research and an interest in close patient of fac^ 

In Chapter 11 cortclat.ona »e.c „„iation coelIiete»“ W 

analysis, a technique which is “PPj"' “ They are more gener# 

purposes ol concept Iora.at.on and n.eas u.oo.en _ _ j „U, 

used in the analysis ol cause-and-cllec. tc P for es 

proportional relationships ate involved. In th „,|.js„re ol Ihc degree ol tela 
Lnple, the correlation coeflieient can P'"')!*” „tabiUty ot im "»• 

tionship between expected value , I'.iUp =«» 

tavotable rating ol a given spce.alty. If a eoelSeienl can be 

any two variables (= g., age and income), the eorrclau 

used to evaluate how dose the telaliooship is. „|a,io„ship, also h^ 

The eoirelation coefficient, a measure “f^ « ^ n,, other, and 

implications lot the ptedicuon ol correlation coeSete" 

there the type of relationship is variable by means o 

is to 1 or -1, the more accurately we can predic „f,dicUon U ^ 
the other. The type ot relationship used m /ojnicicnt approaches 

one. or ouc of direct proporUonaUty. As the . ^ans of the other 

0. the accuracy of any linear prcdicUon of ooe variable by means 

‘^'Tuhould be noted that the relationship between two g . 

linear one. The relationship may not be one of direct p P 
the relationship between s and nn s = 1/2 8 f- Here, i rather than a 

of t and is not directly proportional to i. this is a curvi i gjjgssing de- 
linear relationship. In these situations, non-lincar . r.^ purposes 

gtee of relationship, as well as non-linear mathematical func i 
of prediction, are called for. 


SELECTED REFERENCES FOR FURTHER STUDY 

HubektM Blalock, Socw/Swrijiic* (New York: McGraw-Hill Book 

The author provides a comprehensive treatment of the * gjjology. 

tical inference m the content of examples from sociology and social ps> 

Topics include, in addition to the standard single-sample and jod 

contingency problems, analysis of variance and covariance, corre a i 
regression, and multiple and partial correlation. 

William L. Havs, Swrianc* for Psyehotogisu (New York: Holt. 

Winston, 196J).The author presenu a very thorough discussion of a wide r 
of topics, with special emphasis on probability theory. Core ideas from ru 
276 





—-1 J 




Multivariate Analysis 


In his analysis of data the general aim of the researcher is to provide ^e 
hind of evidence which can lead to an undersianding of pr^essK. 
chapter begins with a discussion of some of the complexities invo ve m 
processes and goes on to consider the way in which theory can contn « 
an undexstanixig of cause-and*cffect relationships. Attention is en 
to the analysis of relationships among three variables (on^v^a e 
variable analysb techniques were discussed in Chapter 13. Given 
situation in which there is a relationship between two vanab es, sevc 
ferent things can occur when the investigator controls on a ihir ’ 

a number of examples are used to illustrate these various possi c 

16.1 THE CAUSAL PROCESS 

It was once generally thought that for every effect ^ere 
only one cause and that if several causes are discovered then 
really be more than one. For example, Emile suicide is 

decided were three causes of suicide.* He therefore conclu e 
not a unitary concept, and that actually there are three e 
suicide — i.e., three different effects, nie contemporary approa puy. 

lowing for and expecting a number of different causes for a sin^ „.^rioiialing 
sicians, for example, distinguish between predisposing causes seauential 

cower of death. This assumes not only multiple causation but ^ 
occurrence (to some extent); one set of causK is predisposing 
occurs prior to another set of causes, which is precipitating. 

A Physical Science Example .. . » 

Tbe complexity of the causal process may be (urns on 

answers given to a child’s query as to why, for examp , Solovay 

* tmUe Durkheim. The RuUs of Sociologiad Method Tl^Free Press of 

lohn H. Mueller), edited by George E. G. Catun (New YorK. m 
C<lencoe.l950). 279 



The Causal Process 


Sec. 16.1] 

Note that three additional events have been added: N. O. and P. We might 
examine these three events together with Events K and G, companng them to 
the other events, all of which arc listed along the s^e line. The distinction 
is not in terms of relative importance, for the omission of any one of e ten 
events leading to Z would prevent the lamp from turning on. Also, m up e 
causation exists not only in the process as whole, but also in certam ^rtioiu 
of the process. For example. Events F. G. and K all contribute to ven 
(the completion of the circuit). In order to complete the circuit, e sw 

must be lipped, a wire must connect the switch to the lamp, an o en s 

the wire must be attached to an energy source. The distinction twtween ve 
G. K. N, O, P. and Events F. L. M, H. I. and Z seems to be that the former 
represent the iiuUal characteristics of the situation and the suuc c aiac ^ 
at a given point in lime, while the latter seem to be closely re ate to > 
changes in the situation. 

Physical and Behavioral Science Combined 
The causal process docs not begin with Event F and end wlA Event Z. 
following provides an illustration of what mi^t occur prior to • 


Cupetio'**" 
•Kot gool con 
Cool of b*b*»t»r 

jKoighloniofl 
go room ««of« lio"* 


£xpoctoiien 
Ihot flipping 
fwifth will 
pro»id* 
ligKt* 


Individuol O 


This example of the reasons why a pvco person that 

ends where the previous analysts be^, thus jjgQjiveness of the 

covers a wider span of time. In this manner, the p . the 

causal analysis is increased. One ““S’** ’lamp on subsequent 

causal process by examii^g the impact erf tunune total^explanation. It 
events, thus increasing the comprehensiveness of ohvsical and 

Aould be noted that L total explanation includes aspects of physical 
behavioral science, perhaps indicating a certain umty o sci 



MULTIVARIATE ANALYSIS (Event F)- Wk™ 

(Event Z) in his room when the trfl switch “ ^ a) connects 

child is very young, we might simply jmroduce the idea oi 

the wail switch to the lamp, '"'•en heis older wc ni g . 

an electric current (Event H), whi^ "”7 „me objects (such as 

bit of metal inside the light bulb (Event /). and th^ ^ „„„ 

this piece of metal) become very bti^t wtmn ■ y.^^ „„ 

thorough explanation would refer lo the latest difference m 

The suSation'^ For example, we might to the 

electrical potential exists between the two ends M l ,he circuit 

energy source (Event K) and that »'PP;”6 J higbet Wl 

(Event L) and thereby allows electrons to move fre y 

to the lower one (EventM). , ^ arrow be* 

We might diagram the lelationshtps ’’'7,iJ„ship Thns, d -* « 

tween two events designate a cause-and-eHcc , ; „jte a silualion m 

signifies that Event A causes Event B. Also we may g 
which Events A and B ]oinlly operate to cause C.- 



The following diagram summariies this discussion: 


Electrons Electric 

allowed to current 

move front high flowing 

to low potential in wire 

levels 


280 



Sec. 16.1] 


The Causal Process 

Note that three additional events have been added: N, O, and P. We might 
examine these three events together with Events K and G, comparing them to 
Ha otlar events, all of which are listed along the same line. The distinction 
is not in terms of relative importance, for the omission of any one of e ten 
events leading to Z would prevent the lamp from turning on. Also, mu Up e 
causation exists not only in the process as whole, but also in certain portions 
of the process. For example, Events F, G, and K all contribute to Event 
(the completion of the circuit). In order to complete the circuit, e smt 
must be flipped, a wire must connect the switch to the lamp, and bot ® ® 
the wire must be attached to an energy source. The distinction betw een ven s 
G. K. N. O. P. and Events F. L. M. H. I. and Z seems to be that the former 
represent the mitial characteristics of the situation and the static charactensUcs 
at a given point in time, while the latter seem to be closely related to ynamic 
changes in the situation. 

Physical and Behavioral Science Combined 
The causal process does not bepn with Event F and end with Event Z. The 
following provides an illustration of what might occur prior to F. 


A • . 


Sun moves 
below horiion 


Expesloiion 

thot goal eon 
Cool of b» , . 

sfiaighlenina o«hl*»*o wi'n 
up toon* mote I’Obt 


C 



0 


Eipoc'alion 
(hot flipping 
switth '•ill 
provide mof* 
light* 

E 


! 

I 

i 


Individucl con 
no longer see 
clearly 


“ * • „ mlchl nip W" 

"This example of the reasons why a gi'C i an cxplanadt^f* that 

®ods where the previous analysis begins, t us p(„„piclicnsivcncss of the 
^'crs a wider span of time. In cuiitsc, continue with the 

causal analysis is increased. One mlgl» * ihc l.«Hp subsequent 

causal process by examining the Impact " explanation. It 

evenu. thus increasing the comprclicn«« . ^.pcdi of physical and 
should be noted that the total unity of science. 

Wiavioral science, perhaps indlcallng « 2ti 



lCh.l6 

MULTIVARIATE ANALYSIS 

In addition to evaluating the of detaU pro- 

respect to cootprchenslveness one e„„ls nhich occur 

vided. For cjample, the 1*'“ „^,„rninl on (Z), thoogji only u 

between flipping the switch (F) and the larnp g ^ „ j,,. 

split second actually elapses between F and Z. It ^ 

tinguish many more initial condruons^-fr. ^ ^.ircs are well 

stance that becomes very bright when “ “ jjaiaonal changes in 

insulated Irom one another. It IS also possible to s^ ly from 

the situation which lead to the lamp tumng on 

the source of energy to the wire m the tou“h^^ oonstitutes a never-ending 
adding both comprehensiveness and detail, and th 

aspect of the scicnliflc process. involved fundamental 

This attempt to esplain certain causal P'^““ ^ utory in th“ 
theory both in physics and behavioral science ^ oonditions and dynamic 
contest is to select from among the myiiad in investigation, 

factors, those most directly relevant to 'h' nLhnation one could con- 
With respect to the behavioral science part of the cspl u,rongbout the 

ceivably refer to the many difictent espetienccs . in ,i,is particulat 

life of the individual and which lerf to his flipping 1*“ . ,i,ji it will he 

situation. A given theory of nPo*'®' fi ,1,, Present situation, 

enough to focus on a few goals and a few cxpectauons m P 

A Behavioral Science Example 

Another example of a causal sequence may be diagrammed as follows ^ 

' o a- ■ --S, ■■ t- • d 


Child Child 

pvti 8®V “ 

to« bed 


proochei pul owoy hU mother 


Child wonfi to itoy m 
mother'! good grace* <i"U 
also reolties that he w.» 
be punished if he deesnf 
put away hi* toys 


282 



Sfc.M.i] 


The Causal Process 


The theory involved here b that one goal b developed on e , 

goals. The other goals in thb situation arc the desire to stay ® 

good graces and the desire to avoid punishment. In thb way e gni 

kams that the goal o( putting away one’s thinp is a desirable one to have. TOc 
theory here is simply an extension of that involved m , 

study in this ease, choice among allematives leads to the development of 


Theory and Causation 

Thus far m our discussion of the causal process we have 
and eSects as if we had a special kind of insight mto the 
actual situation in which the scientist finds hh^s^ 

variables and a complex network of interrelations ps gre con- 

the relationships have to do with only '"J V j eloseness as 

cenicd with a larger number. The "'‘‘““^'J^^^sses seemed somewhat 
well as m kind. If the above examples of cans vanablcs 

complex, when we deal with merely statisucal re 

the complexity b compounded many fold. complexities of the 

fheoty, which enabled us to lind a P’* Enough the maxe of 

causal process, is abo our best tool for volume outlined the 

relationships among variables. Earlier se^o collection of data, 

role of theory in the definition of a research p > important, 

and in measurement; its role in the analysis v,as called to the 

In aapter 13, "On. and Two VanaWey 

two-variablc situation and hypotheses tna _ or less in 

emphasb was on testing a particular h^t indicate that advances 

other hypthescs. But the history of and test systems of hy- 

are made when the scientist attempts to ® process as a vast search 

potheses, or theories. We could j jjje situation where many of 

for tiny pieces of a very complicated P” merely look like them. The 

the objects found are not pieces of ** jjgnd to a Type I error (rejec- 
fiodino fif «iie-K a dffpnlive oicce would CO alternative hvTvathp. 


the Objects found are not pieces H 

finding of such a deceptive piece wou j^ce of the alternative hypothe- 

tion of a true null hypothesis and, thus, “ where it actually was buried 

sis), while the failure to find a piece “• ® p null h)T>o^esb that, c.g., 

- -r (accepting a tm ^ 


sis>, While the failure to nnu * ^ null nyTo^^esis mat, c.g., 

corresponds to a Type II error It would seem that one of the 

no relationship exists between two to try to put the pieces of the 

best ways of proceeding in such a si u ^ those sets of pieces that have 
puzzle together and to make analogy might be between trying 

already been put together success theory within the context of jasti- 

to put together pieces and testing between using sets of pat-together 

fication. Also, we might draw an - development of theory within the 

pieces to find other pieces and tM the best tool for detcimining 

context of discovery. Theory wo . 



lCh.16 

multivariate analysis jjjgj „ 

whether any isolated pteee is a decepdve patce or not by tesuag 
not the isolated piece fits into the to.,, fo. »»>!“ 

The importance of integrated sets of hypomc , 

of data is commented on by Camillcn. ^ 

In the development of rf aVi>' 

to — the t'lrr «-»■ 

hjpmh'esis as a ,hm,rem. l.'T.Vexplieit “S 

upon is valuable chiefly for .1. produced te 

should do with the dedoct.ve system Jif ^ „ ,he systemahe rmp” 

U is this conceptual relevance iha w 

an empirical result ^ 

Camille, i emphasiaes the importance «<«"”: tS 
as the limitations of testing a given gj„„,ses the hmilaO"” 

system (i,e„ t.om theory). In thfe to been primf? 

statistieal tests of hypotheses. , o,hets. Very W' “SjolJ. 

tor testing a given hypothesis in iso'®'"" „d„ced tot the les™8“ 
of statistical techniques seems to have been p 

16 2 THREE VARIABLES; AN ILLUSTRATIO 
^ SPURIOUS RELATIONSHIP ^searcher 

Wc now shift our attention lo the asis of 

decisions about the nature of the ca^al proc primarily. ^ 

interrelationships among variables, sh^l “ ^ js of 

variable case, for the compleMties of i^j'rring 

' —I the scope of this book, wn -a^ing deCJSio“* . 


variaDie case, iwi 

variables are beyond the scope of this i^n. j-mg 0=»'"- 

on the basis of relationsh.ps among vanabtes, we a to IM 

faee of unceilainty. The problem here nt I"' 6““f pac esample » ^ 
ntlvt The process of statisucal decision making is r j.cisicm 
SniisreCs lo develop s.ralegies for "f ^riSuyr^^ 

“wTstod in a canse-and^l^ee^ relationship lo one an 
yanables stand m a 5 ..,., r„ 



Sec.I6J] Three Variables: An illusiraiion of a Spurious Relationship 

One important guide for obtaining such evidence is Mill’s method of diffe^ 
ence, which deals with two situations that differ in one respect have eve^ 
other circumstance in common but one (the second respect in w ic e 
situations differ). In this event, according to Mill's method, we 
one of the respects in which the situations differ as the cause, an e 
as the effect. Of course, the present-day research framewor t at mos s 
lists have adopted no longer permits the statement with certainty a 
given cause-and-effect relationship exists. But the method o * 
neverlheicss be used to provide evidence for or against a ^ 
effect relationship even if the evidence is not viewed as adding up o 

A Two-Variable Relationship , . 

To illustrate the procedure, let us assume that the effect we ^ 

is the amount of property damage caused by any given re m 
within the United Stages. We shall designate the vanable nmoun 0/ 
as y, the dependent variable. Let us now consider, as one 0 e p 
of the amount of damage, the number of fire engines presen a ^ • jj, 

us designate this as variable x. Of course, this is a 
terms of our knowledge of such things, but it will be to our 
up a situation in which we know no actual causal relauonsbip Lories 
know what the nature of reality Is. we can belter 
designed to make correct decisions about it on the basis of r 

presents hypothetical data on the relationship between x and y. 


TABU 16-1 

[xy]! THE RELATIONSHIP BETWEEN ENGINES 

(x) AND AMOUNT OF 

.. of fire Engines 


Amountof 

Damage 

SlO.OOOormote 
Under $10,000 
Tolal 


Oor 1 

30 

70 

too 

(1500) 


2 or more 
f%) 


(700) 


The data indicate the percenuge of fires en^es were 

SlO.OOO or more among those fires at which two or which one or 

present is substantially greater than that among those nres 
no fire engine was present. 283 



MULTIVARIATE ANALYSIS 

A Statistical Test _ t, nr not the relationslup 

A statistical test on the data will he binomial test (dis- 

between ^ and y is statistically si^ situation in which two variables are 
cussed in Chapter 15) does not apply toasmaMn _ 
cross-tabolated or interrelated as ^ ^femilojed." 

significance of differences Mows: 

We might specify the nuU and altemattve hypotheses as 

H„-. Pi = Pt 

A one-sided test is indicated in this 

that the number of fire engines would be ‘ ^ions of fires where 

amount of damage is great. P, and Pj refer Lyiations from which the 

die amonnt of damage is JIO.OOO or more * Pf ^ „„ne, respee- 

sample was drawn (where two or more ^ , do„ as consisting oi 

tively. ate Involved). We might eon«.ve of P°P““ „j p,„b- 

a large number of fires from which the “.'"P'‘J“ sfLficance. Let M 
ability sampling. It remains now to specify the level of s ^ a 

here set n = .05. Thus, if H, is t.,eeted at this levd ptob J 
difference in the appropriate ditecuon does not exist m P P 

”°Calcu”doiis indicate that a sample diflerenee between P‘ hy- 

i„ fact statUtleally significant at the .05 level. Thus, we reject 
* The I test makes use of a sample statistic already d'seussed m the i6-l it 

of association: the difference Ulween the to Ihe pro- 

can calculated that p, - p. = -59 - .JO = .29. In Table 16-1. P « «« « „„„ 

portion of fires with damages of SIO.OOO or more among * * "th damage over 

fire engines were present, while p, refer, to tte P">P°"‘0" ,his test, «« 

JIO.OOO among those fires where one or no fire en^ne was ^ 

assume that an infinite number of samples of size 700 + 1500 i jp, _ p.) 

from the same population. If a suitable transformation of the By setting 

is made, we can assume that this sampling distnbution will be * '•“'f'" Hfipide whether 
up a level of siguificance and a region of refection, we will then be able to 

to accept or reject the null hypothesis ,h* .icmficance 

The formula for this Iransfortnaljon, which is the basis for the /-test tor 
of a difference in proportions, is: 

Here pi and pj refer to the two sample proportions — e.g . .59 and 700*^and 

to the column totals on the basis of which the proportions are calculated — e g • 

1500. Tables containing the calculated values of Ihe transformation are available 
convenience of the researcher. See Vernon Davies. "A Rapid Method for Determ 
the Significance of the Difference Between Two Percentages." Stations p 
Washington Agricultural Experiment Station., Washington State University, ruiima 
Wash. 


286 



5.C. i6Jl 

pothesb that Pi equals Pa and accept the allcmativc ’ gj,gjj,es and 

Tier words, we find the relationship between the number of fire engm 

the amount of damage statistically significant at the ^ela- 

TOs example prosides another illustration ^different things, 

tionships and cause-and-eflect relationships may obvious that no cause- 

In this case, because of the nature of the dau. .t ts Really sig- 

and-eSect relationship actually exists. »>thou^ fortunate as to know 

nificant relationship. But the investigator is usu y ^ 

that a ^sen independent variable could not pos i y -jovide additional 
dependent variable. Are there any further tes 

evidence as to whether or not the relationship IS comparing two 

Following Mill’s method of difference, we ^ two or 

groupings of fires— those with one or no J p groupings have been 
more— with respect to the amount of dam g .. equating the 

equated on certain other factors. Mill’s me -ui. jf after the groupings 

t»o groupings in all HtUl observe a rtlallDnship be- 

have been equated on a number of factors, damage, then this may 

tween the nLber of fire engines and the amount of damage^. 


tween the number of fire engines and the vgt^^een these two variables, 
be taken as evidence for a causal /. ,be before-and-after experi- 

The rationale here is the same as that u ^ ..nnfrol ctoup arc sub* 


the same as that uUlJ«d “ ^ sub- 

ment with one control group. The expc 5_.ofar as possible in all other 
jected to different treatments, but are croups on the “after 


,ts. but are eq^tvu ..after’ 

en emerges between lb ^ independent van- 
.e difference in ueaim ms « have 


respects. If a difference then e oi u... r— 

test, we attribute this to the difference m “after” test, we have 

able. Oir the other hatui.il no dilleren“ ' ^l_ beween the Me- 

evidence to the effect that no cau 
pendent variable and the dependent vana c. 


Controlling on a Third Variable groupings of fires 

The way in which we will to some ***^* gjjgjyjis in general and survey 
will illustrate non-experimental context of the expenmen 

analysis in particular. The an^sis ® widely used technique of survey 

has already been taken up in ^^ble tables, with a different mird 

analysis is to set up a series of eroupings in each of the tables. 

variable used to equate the two ongm ^ variable introduced m 

third variables are called ^ ^ the fire^U-. new or old one^ 

Table 16-2 is the type of fire enp that no combmauons of both 

(Let us assume for purposes « . g^e.) . . . 

old and new fire engines appeared at & together. A one-sided 

Table 16-2 is just Uke two proportions indicates ^t 

r-tesl for the significance of ^ Ulustrates a situauon m which 

both are significant at the .05 leva. between the independent 

the control variable is irrelevan 287 



ICh. 16 

MULTIVARIATE ANALYSIS ^ introduce hun- 

variable and die “”“„p the same statlslleally 

deeds of othe. irtelevant variables and amount of dam- 

significant relationship between variable (a) does not 

age.) It is typical of situations in whi* te t independent variable 
riult in a change in the initial relanonship between 


TABLE 16*2 

[ay; a): THE RELATIONSHIP BETWEEN 
TOE ENGINES W AND I, ENGINE iV 

CONTROLLING ON TYPE OF 


New Fire Engines 
O-I Fire 2 or More 
Engines Fire Engines 


Old fire Engine* 

Engines FireEW"" _ 
(•/.) ! 


Under 

$10,000 

Tetol 


Under 

$10,000 70 


U) and the dependent variable (y). As cm readdy be ““ isV 
the relationship between number of 6re engines and amount 
same regardless of whether new or old fire engines are involved. 

Having illustrated a situation in which the control vanable 
to either the independent variable or the dependent vana e, analy- 

illustrate a situation in which it is related to both. Let us ass^e . / j) 

sis was made the aim of which was to relate the number o -^^bles 

sent out after each fire was discovered by the fire departmen -jables 

X and y (number of fire cnynes and amount of damage, respective y . ^ 

16-3 and 16-4 present the relationships between f and x and be we 
y, respectively. /.■« 

Table 16-3 shows the relationship between the number of alam . 
the number of fire engines (t), and this relationship is statistically sigu^ 
at the .05 level. A statistically significant relationship at the .05 1®'’® * ^ 

found for the data in Table 16-4, where number of alarms (0 >* d'eated 
amount of damage (y). If a relationship between two variables is m 
by a line of dashes, then the following diagram summarizes the relations 
among z.f, andy: 


28S 



Stc. I6J] 


Th,e^ Va,iableK »/ <■ K,l,:.cnship 



It remains to invcsUgatn tic ““''‘'“"°“!”'”„;'Xrto Demins vthethci 

when taken togelher. In panicular il WO K»fi,.cen x and y. The relation- 

controlling on Variable t alters the relauonship y-QUej qj, z. a variable 

ship between x and y remained the same w 

--- j 

I I,.,: tHE 


Number of 
fire Engines 


0«I-Alorm 

fire 


tabu 16^ 

[tyl: THE OF D^«" W,* , 


Amount of ^ 


Q_|.Afarin 

fire 


$10,000 or more 
Under $10,000 


289 



[Ch. 16 

MULTIVARIATE ANALYSIS addRional S«ppc« 

wh.ch was related neither to « not to T, and ran q y 

was found for a eausal relationsh.p betwee^ a and y. If m^^P^ 

we have a disappearance of the , „,adonship accordine to 

be presented against the existence of such a causal ^ 
the method of difference. This ts because, whe alarms, no cor- 

with the 2-ot-more engine lite with respect to 15.5 

responding difference with respect to ® (a-j and amount of 

presents the relationship between number of fire enpnes I 

damage (j) with a control on the number of alarms ft). 


TABLE 16-5 


[aytl]. THE REIAIIONSHIP BETWEEN NU™" 
ilRE ENGINES W AND ‘“O'”'’ ^ ai ARM 

CONTROLLING ON NUMBER OF^AU^^jJ^ , 


Nvmber^o^. Alarm * „ 


$10,000 


O^Ioolarm fire 
3 or more 0-1 fire 

fire engine! engine* 

f«/«) 


sJ-S'olarm fire 

0^1 fire 

engine* 


2 or more 
fire engine* 




Under 

$10,000 

Totgl 


$10,000 

Under 

$10,000 


Table 16-5 may be viewed as a combination of two fourfold table^ 


lame iO-3 may oe vicweu as a comuiuaumi irr- „i,h the re- 

Table 16-2. Each of the two fourfold tables in Table xhe 


lationship between the number of fire en^nes and the amount 
difference between them is that in the left-hand table only 0-1* atm — 


involved, while in the ri^t-band table only 2—3-alarm fires are » 


, included. Thus, 

the number of alarms is controlled in eadi of the two fourfold 
result is that the relationship we found earlier between the num er 
engines and the amount of damage completely disappears. 

In accordance with Mill's method of difference, evidence has here been p 
sented against the existence of a cause-and-effect relationship between 
number of fire engines and the amount of damage. Of course, we Ime 
all along, but most realistic research situations will not be so obvious. ^ 
might now attempt to diagram the causal relationships we suspect might exi 



Stc. I6a\ Three Vviablei: Art Illustraiion of a Spurious Relationship 

amongx, y. and t. SoUd lines with arrowheads will be used to indicate a sus- 
pected causal relationship. 


There is no causal arrow between x and y because 


the statistical relationship 


between these variables disappears when we conuol on r Of 
no assurance that there is a causal and the amount of 

other variable m might cause both the number of aJ 
damage: 


As a matter of fact, this is the actual si ^jegrec of damage 

the idea that the number of ^axim ( (intensity, spread, and t>-pe) 

O'). Uwe conceive of mas the inmal number of alarms and the 

of the fire, then a senous fire wiU^ diagram may become quite complex. 

degree of damage. Thus, any &VW J additional variable causes a 

There is always the ^ jn diis situation illusuated by m, t. 

suspected cause or a sus^cied ^ relationship as a spurious 

and we designate the mioal suspecteo 



[Cli.16 

MULTIVARIATE ANALYSIS v-„ U that between the 

reh:io,^hip. Another illostrntion of » Allhoosh » T 

number of fire oneiner (t:) “"X L n caustJ relatolnp 

rren“x^nd5:tx‘:oi— 

ber of nlarnts (I), and the number of '"S'"® ' X d„isnal= ite nmn- 

kind of inlc.ielabonihip among three " „ bel.een event m 

be, of alarm, (,) as .be fn.erven,™ “-"“f • ^XThre engines). As « 
(iniual seriousness of the Ore) ‘"XeXfie process b Ute sean* 

saw in Section 16.1, one eharactertsue «' *' Xeben variables. Thus, »r* 
for variables which intervene between "“J. , variables intervened 

respect to the eleetrie-light eaample, » on), 

between F (nipping the wall switch) Xwl^bich have been drawn m 

We might question the d.tecnon <>'*' “ . ,^0 events from the on' 

the diagrams shown. The arrows were drawn b''”'"" , .enetal Imowl- 
occurring earlier m time to the one W Sgrc' of damage), 

edge tells u, that m (initial S iouid! in tum. 

and that m would precede t (number of alarms) 

a (number of lire engines). , , 1 ,. seouenec of events on th' 

Although it is possible to take »Mo *“®“®**^*. .^^hich event actu- 

basis of general knowledge, notbing in relationship he- 

ally comes first, fn Table 16-5. lot erample, we present We « 
tween the number of fire engines (a) and die fiStadon is that 1 

ling on the number of alarms (I). Part out "\„d Result from some 
causes X, and that x does not cause y. because both * presented in the 

other factor. These conclusions arc not based just on the P 
tables, but also on our general knowledge about the t™e ^jif^^ent con- 
events occur. As a matter of fact, we would come to co p 
elusions on the basis of similar data if we had other ideas 


16.3 THREE VARIABLES: AN ILLUSTRATION OF AN 
INTERVENING VARIABLE 

This point might be illustrated by reference to other will 

us designate x as race, y as IQ score, and f as degree of educa ' .jgnted 
now present in Tables 16-6, 16-7, and 16-8 the same data as we P 
in Tables 16-1, 16-3, and 16-4. The diSerence is that we shall be tni 

terms of variables for which a different time order exists. , hies fot 

There is no need to carry through the statistical analysis of these a . 
the same data were analyzed for Tables 16-1, 16-3, and 16-4. Here as ^ ^ 
tables we have the situation where x E related to y. t is related to x. 


292 



5^f. I6J\ 


nr., VariakUrr An Hlnrrrn.wn »/ ™ In-rrr rmng yMU 


TAfiLE 16-A 

Ix/l: THE REIATIONSHIP “TWEEN RACE W AND 
IQ SCORE (y) IN AN AMERICAN Cl^ 

IQ Score — T— 



TABIE 16-7 

[lx]! THE REIATIONSHIP BETWEEN DEGREE OF EDUCATION W 
AND RACE (X) -rr.-.-r- 


Race 

iV») 


White 

Negro 

[ Total 

p 

p 

100 

(1200) 

50 

50 

100 

0000) 


... 

— ] 

1 TABIE 16-8 1 

. 

Nof o High 
School Graduate 

(W 

High5<liool j 

Graduate or More 

\ IQ Score 


r 100 Of more 

1 Under 100 

1 Tolol 

95 

100 

0200) 

20 

100 

(1000) 


293 



multivariate analysis 
related » y. Diagtammatically it might be sumi 
previously: 


[Ch. 16 

imarized in the s™' »“y “ 



/ \ ,^».<»ree of education 

We have found that race (x) is related 1“ 'Q »“'= „uted to IQ s“« 
(,) U .elated to race (x), and degree of 'du“«on C ) « „e 

(,). It remains now to examine a table '''““f '^ 5 involved. Table 16-’ 

another to shed additional light on the cau^ P 
includes the same numerical data as is contained in Table a- 


tneiE u » 

[xyitb IHE RElAtlONSHlP m'^IdUCMION ID 

erftoc (w\ CQNTROLUMG ON DEGREE OF ED 


Ixyiri: ine _ 

IQ SCORE (y) CONTWLII^ 5!1 - 


Naf a High 
School Giodwale 


While 


Negro 


High School 

GradooleorMo''* | 


Under 100 
' Total 



As with Table 16-5, there is no relationship between x and y when 
trol on f. The causal process might be symbolized as follows: 


In words, race (*) affecu opportunities for education, and degree 
tion (r) affects IQ (y). Thus, poorly educated whites and Negroes jjjy 

to have low IQ scores, and lu^y educated whites and Negroes o ° 
well in obtaining high IQ scores. The reason that Negroes tende o „g}j 
lower IQ scores than whites in Table 16-6 is that among Negroes a 



Sec. 163] Three Variables: An lUusiralion of an Intervening Variable 

smaller proportion had had a formal education than among **'‘*f*' 
control on education we see that, when Negroes are ^ua e wi 

terms of formal education, they do just as well on their IQ scores. 

It should bs noted that the inteipretatioo of the s:me 
vaiy greatly, depending on our general knowledge ® number 

vSerin Se e^e of L fire engine eaantple we coneluded tot ^e^number 
of alarms (r) causes the number of fire en^es (x) i.c*. • . jj _ 

eat example however, we eonclude tot race (x) causes U'Sree of eiiueanm. 
(.)‘-i.e! U Also, we previousl, concluded tot 

gines (x) docs not cause the amount of example, how- 

causal relationship between the two is spunous. In p rather, 

e^er, we do not find that the relationship betw^n x variable 

we find that x and y are causally interrelated through ^ deeree of edu* 
(0. SpedficaUy, rL (x) affects degree of educauon ( 
cation affects IQ score (z). We mi^t conclude tha , 
vey data, it is necessary to use whaicvcr ‘ order of 

phenotaena imder investigation to male assumpti 

the events involved. , ^ . njjght occur when 

Thus far we have distinguished two dien conlrol on a third 

we have an initial relationship between x and y 
variable (() : 

1. As in the case of Table IM, U mainlained''5lhin 

fire engine, the initial relationship between X an y 

each of the categories of the control diminishes within 

2. The irutial relationship between x an y P ^ interpreted 

each of the categories of the ““r' ''“tt^.dge of to time order 
in two diflerem ways, on to basis of one s knowieog 

oftheevents: . . , . ,_»y 

(a) Interpret! as an x and y as spurious: 

(b) Interpret to inilial relationship between X an y 


' J 

g — - e>enis intervenes between race and 

- Actually, of course, a very complex than the causal process which 

education. This causal process seems i-mn lurning on. 

haled the flipping of a light switch ana »• ** 295 



[Ch.l6 

MULTIVARIATE ANALYSIS 

As we saw in the fire en^ne proSlity does not canse te 

incorrect, for the number of a'™* <[' „ Jo Hie introduction of a fourth 

amount of damage (y). We ^ ^ , „l,ich was conceived of as 

variable, the initial seriousness of the fire fm). 
causing both ( and y: 


! 




Such additional compleaitles are quite likely in the analysis of the cans 
tionsbips among any set of thtee variables. 


16 4 THREE VARIABLES: AN ILLUSTRATION OF A 
THIRD POSSIBILITY 


Tcfculflr e4milysis ^ »n imtial rela- 

There is a third type of result that can occur when . . 

tionship between x and y and then control on a third van 

3. The imtial relationship between jc and y disappears or intensi- 

one of the caUgories of the control variable but is maintained or 
fied within the other category. 


This third possibility will be illustrated by means 


of actual rather than 


13.4, 


hypothetical data, Wc shall utilize the example the stu- 


dealing with student grades on a class assignment. Let us esigM 

dents’ degree of interest in the class assignment as x, meir gra . ' j,jp 


s assignment as x, j^tionship 

their seating position as t. According to Tables 13^3 


between x and y and between t and y seems to be quite oen . 
proceed to utilize the r-tesl for the significance of a difference be j, 


proceed to utilize the r-tesl for the significance of a ditterence gach 

lions on each of these tables, with Hi being one-sided and a — ■ • 

of the tables, the relationship proves to be statistically significan . • 


296 



S.C. 16.41 Three V.nubl»: An illnsrr..ion of a Th.d Pos.bilUy 

relationship between interest and grade, and between tje_ 

pads, is statistically significant. We might now test for e relevant 

li^een interest in the assignment (a:) and seating position 
data are presented in Table 16-10: 

, TABIE 16-10 ! 

: [lx)i RELATIONSHIP BETWEEN ! 

: DEGR E E OP INTEREST IN A CIASj_«SIGNMENT_. 


Degree 
of Interest 


Inieresled 
Not IntereiUd 


Seating Position 

Rows 4—15 
iV>) 


There is no statistically significant cent ~ 12 p«ceat 

in spite of the percentage difference of 50 ^ could have occurred 

A more exact statement is that a 12 pcrcen ' vbere the true difference 
with a probability greater than .10 from ® following relation- 

in proportions (Pi - Ps) is equal to 0. Schemau y. 
ships have been established thus far: 


causal process involved by 
-et us now proceed to iovestiptt ^^.variable ubie. In Table l^U. 
king at the intenelaiionship w«l^ rclaUonship beti^een mter- 

conuol on seating position (*) r. 

and grade within each of the gorics 



[Ch.l6 ' 

MULTIVARIATE ANALYSIS k are 

This .able dillers from .he to- 

a„d no. pcrcemaECS. Pc.cen.aEes ^-ould 7“' ^ „( siEmhcance on 

quencte an .mall an .hose. We JJl P'^i-hand fourfold .able. The IcsU 
.he lef.-hand fourfold .able and on die r.gni nano 

; TABIE 16*11 

lxy,.l: THE RELATIONSHIP BETWEEN “^NKOllIN "o” 

'tS' ASSIGNMENT W AND C^E (,l CONTBOll.N 
1 SEATING ^ -i 

’ ■ Rowf4;l5 i 

lnl.rej.ed No. Inle.esl.d In.e.^led Nellnlere j 


will be one-sided because out allemalive I'TPotois ‘Damons fbe' uninttt- 
.ion of hish grades ,s grealet among U-o .mere ledjhan among 
ested students. In addition we shall specify that o “ * ' . jeiaiionship at 

The left-hand table resulu indicale a „m stadstledly 

the .10 level.* As for the right-hand side, die r''»'‘7 .P „ interest and 
significant at this level.* Thus, we find a ^ on- students in the 

grade among students in the fimt three 'o"* but „ge„ „e have 

fourth to fifteenth tows. This illustrates the thud possible resu ^^i^gie |. 
an initial relationship between x and y and then control ,^een X and 

In the present example, a statistically significant T^“P° .iaWe while it is re- 
y no longer exists within one category of the control v 

tained in the other. rnree the investi* 

Such mixed situations arc difficuU to interpret. Often they . formula- 

gator to reconsider his original ideas and come up with ‘'"P . . difficult 
tion. In the present situation, the frequencies axe so sma jiniiied 

to make a definitive jnicrpictation. Overall, however, there seems 

» For this table, the hamplc i» loo small to uiiltie the West tables ‘“'{'[j ^ee. for 
used was the chi-square lesl, corrected for continuity. For „ /fjew York: 

example. Sidney Siegel, Sonparamtmc StaUittcs for thr Otha\ lorul ic e 
McGraw-HiU Book Co, 19}6,pp. 104-11). ,,,..,1 Consequently. 

"The frequencies here make rt inapproprute to use the chi-square le • 5g_l04. 

the Fisher exact probability test was utilized. For details, see Siegel, op. c vv* 



Sec. 16.4] Three yariables: An llhatraiion of a Third Possibility 

evidence to the eScct that degree of interest (Jt) and sealing position (/) 

operate independently of one another at least to some extent. 

In further exploring the data, let us first examine the percentage of in ivi u 
als with high grades who arc both interested and seated in the first three rows 
— the two factors most conducive to high grades. Among these 1 1 stu ents 
we have 8 (or 8/1 1 = 73 percent) with high grades. Of the 13 students who 
are not interested and arc seated in the fourth to fifteenth row^ or per 
cent have high grades. Finally, among the 19 students who ave o 
factor favorable for high grades, we have 38 percent w^tb high grades. This 
great increase in the percentage of high grades as one factor is com me wi 
another provides evidence for the independent operation o | M x on y. 

In evaluating the technique of three-variable analysis wi m ® 
of non-expcrimental research, it is apparent that it is possi P 

idea behind Mill’s method of difference by controlling on a jjjg 

variables. Resulls may ba quite useful in analogous to 

causal process involved. The survey techniques of an > 
those used in the beforc-and-after experiment with one 

Some Umitations of Survey Data 

But there arc also important limitations in these survey pr 

. • ran effectively equate the 

1. In Ihe experiment with randomization oe jinml- 

experimental and control groui» on a larg for non-experl- 

taneously. Within the ihree-vanable aoupinS* within the two 

mental data, however, we only equate yariable at a lime. Thus, 
categories of the independent variable on o specification of Mill’s 
it is the experiment that comes much close on all other fac- 

method of difference, that the grouping 

tors, than the survey. j:«-iiv manipulated by the investi- 

2. Within the experiment, events are o „ jnlly occurs in the survey, 

gator, whereas no such manipolauon gene go on 

Without such manipulation, the of events within a given 

in his attempt to determine the tunc 
causal process. 


ome Strategies of Survey Analysis easing the impact of these 

There are also stratepes of research ®^^jollows: 
undamental limitations. Some of these 

huiczl for this boot, such as lud- 
1. Cenain statisUcal procedures of covarimee, aid the tc- 

tiple and partial cottelation Uian three vatables at a tone, 

searcher in dealing effecUvely ^ „ey, houever, an tocuhe 

If the analysis becomes compuou 



ICh. 16 

multivariate analysis ...ciichct vrithin to 

theory beeomcr very imporrorrl lor £ 0 .d.ns dre rcrcarche 
maze ol resulting dala. variables simal- 

2. With large samples, it is , „„ control variable. Here, M 

taneously, as stas Illustrated in the c^ ol ,t 

above, elleetise theoty ,s a seri« .1 *■“- 

seems to be desirable a . .-rjjyj.varublc analysis. 

variable analyses before * .espondenls over a senes 

3. By retrospective quest, ons, or by r^archer can learn a 

of points in time (called the ,»nel 

good deal about the order in nbreh : .ho tool most essen- 

4. in dealing . Uh the above t«o P^'"’f„’„p,mns about uncon- 

tial (or the scientist is his rsillingncss to maLe ,,, ys lack ol 

trolled variables and the scq“cnce » 'ven“ rriUhtgness, he could not 
adequate knowledge about them, ‘ , “jocess. For esam- 

make any decisions about the nature ol he P ^ , vanable 

pic, it is usually possible, when survey *‘ot“re ■» I he a 

which is treated as an edcct ot “^."J^fse on= tmother. Also, 
cause ol it, or that the two variables might »u« pmcess, 

many other variables may play i”P»''"‘ ™ ' o j^y may lead » »« 
They may be involved as intervening variables. spunous. 

inletprctalioa that Ihe lelalionships ”’!,”Mi!lmolions about Ihe li®' 
11 the investigalot is willing to make P0«“" “ voSSedge insolar as pos- 
sequcncc involved (making use o( his 8'""“' jiables that ate not 
sible) and about the lack of mnuence ol ^ mis way he 
taken into account, he can gel on wi* iba mvcsl^a n. 

ma, subsequently find out a great „ me investigator 

involved, as well as about the sole ol ntbet •“ 0 “' ' “ concerned wi* 
is unwilling 10 make such assumplions, il he ov y ^ ^gat- 
thc limitations ol his study, thco be may tad to make any p t 
soever. 


SELECTED REFERENCES FOR FURTHER STUDY 


HUBERT M. Blalock, Jr . "Correlalioiul Analysis S®“**'-!.?f*\he'^ru’thor pro* 

can Anthropologist. LXIh62A-i\ (I960) on spurious 

vides a clear presentation of the idea* of Herbert . J Horth 

correlation. These ideas are then applied to correlational da J 
American Indian. Particular attention u pven to four-variable ) 

Herbert Hvman, Survey Design and Analysis (New P'* ,nalys*s 

Glencoe. 1955). The major sections are: the orientation of -ufveys, and 

descriptive surveys and the functions of the analyst, explana ry 

.. .w... h» nlhlocV.5<»f'fl< 


descriptive survey* auo uw iu«n.inui> w 
’ For a further discussion o( these kinds ©( assumplions. s'® 
Siaitsucs (New York: McGtaw-HiU Book Co., I960, pp. 3J7-43>. 


300 



Selected References for Further Study 

the utilization of survey findings. Within the conlc« of 

reader is taken through a svide variety of procedures uld.«d by sursey > 

Samuel A. Siouffer, Social Research to Test ideas (New it 

of Glencoe, 1962). In his introduction to this ihrough 

as providing “for a new generation of students » “ ^ oresenl state of empirical 
lAhich they can understand the development and ^ , -niervening oppor- 

social research.” Topics include reference groups, a t eo^ -orrclational analysis, 
tunities, the elTccts of the depression, the case study, and correlational 


301 



SOME APPLICATIONS OF LOGIC AND MATHEMATICS 
of rcalify V.h.ch appear signir.cant to to mars aod there deeistoto v 

°“A.tu 7 n.a.he™^^^ 

cian, our primary coacem here is »ith tto » ss„ oeitam spe- 

commonity of scientists, -^e 'ar'^aE' ^ agreement atom 

cial advantages over everyday speech ™ language. Words tend to have 

mearungs of words among the usem of ord^ryUngo^^^ 

a wrde variety of connotations “,ad Among scientists who 

language, thus, the uniformity of “ ^wever. this difficulty of com- 

communicate in the language !”"''’'”““^.^,„,a,ies constitutes an intei- 
munieation almost completely disappears: “ 

national language which is neithervaguertor^b,^™ . aommonieation 

One reason tor the enectiveoess of ‘ toU or symbols used m 

prrmess is that it is telatively divorced frr^m ffie abels S a new 

Leryday speech. This ,x,ses ffie drJeu'tV ,he niuntor 

language. Actually, this IS no smidlditfieutty, tod r sets 

of Wivlduals who can effectively „c„cndoS advantage of 

matics. But along with this dilTicolty ^ mvoiv'd. di' vagueness 

unilotmity of usage. Because eveoday speech is vagueness 

and ambiguity associated with it arc not Pf'"' 'Jf 'yi'a^teal sciences in 
and ambiguily which presendy plague teachers of the behavioia. 
their efloel to get across concepts pihrased m oidinaiy Ito^ag ■ 

Even more harmful lot the scienuBc P^'^ given word may 

ambiguity are conveyed among scientists as well. “ ®‘jcfi„idon itself 

be assigned a specific definition by a given J „d ambiguity, 

consists of words, each of which has a degrea of va|renass ^ 

In this way, the mvesligator may onwittingly skirl y aral- 

possible alternative definitions. Ordinary language petnuts Him 
ing such decisions by hiding in what has been called rhe penumbra 01 

nej5 attached to a ^ven word. . c -.-ton nf role (some- 

This might be illusUated by a leading textbooks definiUon ^ 

times called joewJ role): "... a pailem of behavior assoaated 
tinctive social posinbn— e.g.. that of father, teacher, ° they 

Most roles specify the rights and duties belonging to a scKial p • 
tell the individual what he ought to do in his role as father, or 
whom he has obligations, and upon whom he has a rightful claim. 

' This idea misht be carried fonber thrau^ an exploration of the jea is 

which the values of a given society affect the development of its .'“"Suas ■ . 

not limited to the societal level. For example, we can also investigate th* , ij-a that 
of scientific language in the commutuiy of scientists We can even examine l 
each individual selects from the available concepts a "private language w speci- 

as best serving his own purposes. Such a pnvate language might involve a lurvn r 
fieation of the meanings of words than u provided in the vague, generally 
meanings implied by ordinary usage. ,953 

• Leonard Broom and Philip Selznkk, Sociologf (New York: Harper and Row, 

p. 16)- 



Language 


Sec. 17.]] 

It seems that there is an ambiguity in this deSnition. and 
meanings implied represent quite diiieieut appronuhes to ^ 

In the i£t seuteuee the relereuce seems to be to the X 

displayed by the many different occupants of n Soe p 
actions which all or most iathcrs hafe in ° js the ejpecta- 

eser, refers to the “rights and duties” inyolved or, he 

tions as to how an individual sWd act or be ave q,e quota- 

actually does act. A definition along the hues “ explain uniiormilies 

tion would lead to circular reasoning if ™ pt uniformities of 

va behavior by means of a concept of role ^ students in coming 

behavior. Thu\ in atumpting oJ s.utut - 

to class on time by saying that this is part of because they come to 

in effect be saying that su,d.n,s com,: ,o dust on nme hccnuse y 

cftus on iirne. . . _ does not, 

A definition of role in terms of rights ° , jjons of students with 

on the other hand, lead to such p,e same as their oiert 

respect to the utility of coming to class on ume “ conform to this 

behavior as is indicated by the fact that many students do 

expectation. . eliminate such ambiguiUes. 

MaUiemaUcal language forces the scicnto specific •‘content" with 

The verbal languages arc desperately nreded q,ijht well be used 

respect to the cxplanaiion of phenomena, bu . . jj original non- 

lo restate non-mathcmatical propositions an _.,hematical reformulation 
mathematical theory is any good, an gjjijs$ues. 

of it may remove various ambiguities and rev lanEuage. The special po- 

This brings u, .0 the second characteristic f “^^.o be a function 
icntial of mathematics for the advanccrocn o riariiy. We have seen, in 

of its high level of abstraction as well “ ® ‘ abstraction is generally 

discussing propositions and theory, that ^ mathematics arc so very 

associated with wide have considerable scope. As a 

abstract, we might very »cU expect t jjjg same 

matter of fact, the scope of ^Jcnccs. This is the case with 

kinds of formulations are useful for dinc«m illustration 

respect to probability theory. ^ „ of choice behavior used in the 

of its use in the reformulation of the tneory 

medical student study. %aluablc for purposes of scienufic 

The clarity of malhcmaticaU^guagCy W complexity. Theory 

communication, also enables the . • jjij mathematics is par- 

has been defined as a system o p .j,jjomhips among a number of 
licularly well adapted to “‘"P?l J^, ihc use of mathematics en- 

vxrbblcs in u elenr^ut lushtou. Mure he ordinarily 

courages the scientist to think in a jool for the development 

might. II it is true that theory » tbeK^ Iwguage of nutbe- 

and testing of explanations and predictions. 



SOME APPUCAT,ONS OP LOGIC AND ,, 

malics is a highly appropriate one lOT sl^g theory, 
comes extremely useful for the scientific process. 


lCh.l7 


comes extremely usuui iw. — 

Some Limiicions of Ualhomamol Formulation! 

Mathematics is not. of course, the only oid-fashioncd. The 

rrrerrts of theory in ordirraD ' ^ JTurfrer rfevdoping a grven 

language of mathematics constitutes a mathemaucs can 

thcoryrhu. if the theory itself is no. ^ o, as an auto- 
imp, ove it. Mathematics rs ^ Sometimes the veD «>”• 

maUc means for the caplanauon of ,ie, utUizerl is seen as 

plesity ami sophistication of the •»P' “'T^tage, tor rliffi^^^^^ 
sell-iustitying. Ye. such complearty ean ^ ^ Furthermore, al- 

cmerge in attempts to test a highly *°™ bn might be of interest 

though a more mathematically sophisticated eritcrion lot evalnatiug 

to a malhemaUcian, this is ccitaiiily no foimulatmg 

theory. MaUiematieal eompleaity is p,nposilions. « thu 

statements of the systematic '"''7““Sp„7“®mneh*e better, 
end is as easily served by very simple formuiauons, so 


The Languase of Logic Like reaf^e- 

The language of logic is very simUar aoy particular 

matics, logic is highly absuact and has iio middle is that 

kind of phenomena. For example, ihc principle of the jer to 

any orolomcn mm, he efrhei; rrue Ih u* 


any proposition must be either true or false. Hie statemem 
any patlicular types of propositions, but to pro^silions 8 . apply 

maVbe iudged by giving i. speeihe /■»=■ 

it, as is quite frequently done in inany difiercnt aspects ot l j, po^hesll 


it, as is quite frequently done in many difiercnt aspects or i . ^^hesi5 

css. For example, it is common scientific procedure ^thesis entails 

and an alternative hypothesis. A decision to rejectee null y^ 
a decision to accept the alternative hypothesis. P'* Lpoihesis 

utilization of the principle of the excluded middle, m whic . • .^y,ith the 

is viewed as being either true or false. Our assumption, m P 
principle of the excluded middle, is that the null hypothesis is cim 
untrue, and that if it is untrue then the alternative hypothesis m 

But is the proposition that light iffnstrarej the behavior of par of 

false? In some situations it is true; in others, false (e.g., the p e pgsed 
Ihe diffraction of light where light illustrates the behavior of waves as 
10 particles). When the principle ot the excluded middle is i„jjy in 

it may be applied to such proposiuons, it does not seem to ho . 
mathematics, many of the theorems of Euclidean plane geomeuy pr 

when interpreted so as to apply to the surface of a sphere. ,A,ements 

Both logic and mathemafics have been used by the sciences in s . 
of theories. Both are also useful for the methods of science — i e., t cm 



Language 


See. 17.1] 

of dtttmming the nature of reality— where they function pmnarily wi^in the 
context of justification rather than that of discovery. Accor ing to cn 
Nagel: 

Logic may be saiJ to be coneemed with the question of the adeqoa^ or 
pmbatlve vie of dilfereot Linds of evidence. Traditionally, hone'er it has 
devoted itself in the main to the study of what constitutes proof, that is, 
complete or conclusive evidence.* 

MathcmaUcs, too. has had similar applications in the context of statistical 

decision making as it applies to the assessment of evidence. h? 

Of course, Ire is no reason why borh fogie and 
appfied to the context of diseovety. Some of the 
mathematies were presented in Section 14.1. in 

decision making. And Kaplan presents one argument for a logic of discovery. 

The point is. however, that “invcnnoo” om he Is^or wholly 

acientifie enterprise ha, a .igndlciu.1 element of luck .0 t. “ “ 
a game of chance, and scientific training surely cn a strategy of 

sM of the players. The ••logic of discovery" ir, so to say, the strategy 
playing the game.* 

At present there is considerable ('"““^^^te^ormaTcmaU^ 
havioral scientists for the utilization h* ^^'^■Ais has resulted in part, 

j ..... M tjisic and Scienufic Mtihoa 

* Morris R. Cohen and Emesl Nafel. An Inh^ucM 

(New York; Harcourt Brace* World. 1934. P f'- F^ncisco: Chandler Publishing 

‘Abraham Kaplan. The Conduct of In^u^n (S«> 

Co.. 1964. p. 16). colleciioas of materials which have appeared 

‘The following is a list of some of the -opearance: Joseph Berger, Bernard 

within the past few tears, presented , t>bm of FormaUzauon m Small 

P. Cohen. J. Laurie Snell, and Morns • ..rtk- John G. Kemeny and J. Laurie 

Group Research (Boston: Houghton MiOTo /Boston: Ginn and Co., 1962); Joan 

Soen, Malhematical Models in the SomI (eds ), Mathematical Methods in 

H. Criswell. Herbert Solomon, and Palr^ ,562); R. Duncan Luce. 

SmaH Group Processes (Stanford: ./ Ktaihemaiical Psychology, Vols. I 

Robert R. Bush, and Eugene Galanter. «o James S. Coleman, Introduction to 

and 2 (New York: John Wiley and So(& » ^ Glencoe. 1964); George A. 

Maihemaiical Sociology (New York: York: John Wiley and Sons, 1964); Fred 

Miller, Afa/)iemoriCj and ^’•*>‘^^"*3' , £^pioralions in Behatioral .Science 

Massank and Philbum Ratoosh (eds.), rwsev Press. 1965). For a recent treat- 

(Homewood. 111.; Richard D. . soeciaJ interest to behavioral scientists, see 

ment of topics in maihemafics whwb m Social Analysis (Indianapolis: The 

Robert McGinnis. Afar/iemoncal ^^piores the application of mathe- 

Bobbs-Memll Company. 1965). Forai Theodore, Applied Maihemaiics: An Iniro- 
maucs to managerial decisions. s« « 1965). 

duciion (Homewood, ill.: Richard D. Irwin. IWJ 

309 



SOME .PPUC.T.ONS OF LOC.C a 

mustraEons. sdecad Icr , ha degree to »Wch formal 

theory of choice behavior and y development, 

atatemenu (logical or mathematical) may aid m 


[Ch. 17 


n.2 THE LOGIC OF CHOICE 


A Descnplion al an Experiment uhich took plae' 

Let us esamine an experiment on a ^^ad to predict nhethet ■ 

under highly controlled conditions. The sub] jaj aad he 

a plus symbol or a check symbol «»“'■* 

„L instructed to make as maiiy ' did occur. The select 

made each prediction, he was told which / hut the subject was 

tion ol each event was determined by a random process, 

“X". okperlmen. w„ to — V 

led to the sobjeefs predictions ol the set ol ^ .jP ,„hjec, i, simply 

collecting data is called the thmkms ,he protocols which 

instructed to vctbaliie as he is performing the task, and p 
emerge deal directly with the tasE ,,,hieci-s choices proved to be quite 

The cogniuve processes involved in the slice s ^ stages. 

complex. The decision-making P'Jf' ““"'1'° “ „rted, and the pre- 
the explanation of the event which had most „ »ell as other 

diction of the next event on the basis of “f hols presented to 

factors. Because the subject was not 5 mm it was in fact pos- 

him were generated by a random Prf«ss Ji. «ed tl-ltl his eog- 

sible to "explain” the occurrence ol a given ' PdW ,i,e behavior 

nilive process represented an attempt to understand a P „hich 

of the experimenter, who he believed to be making the decision 
symbol should appear on a given trial 


An Explanation of the Binary Choice Process in the Experiment 
Analysis of the protocol indicated that the subject treated eac e 
an element in one or another of a set of possible patterns-^ g-. 

. . . y;/y . . . ++++ ^^++''^++^^++^^11; of these 

first stage of the decision-making process, the subject selec 

patterns as the basis for explaining what happened with resp -receding 

that most recently occurred. Let us call this the event at trial ^ a 

events will be designated the events at trials t - 1, t — 2, f ’exolanation 

Wc may diagram the process whereby the subject anives at an p 

of the event al trial t as a series of choices, each of which imp i 


- •' in Harold Bort-O 

Feldman. ‘'Computer Siinulation of Cognitive Processes, in - ^ j.: 

(ed.), Co'nputer Applicauons m the Beliaviorai Sciences (Englewood 


Prentice-Hall. 1962, pp. 336-59). 



Sec. /7JJ 

s€t of options or alternatives for future 
adapted from Feldman,* of these choices: 


The Logic of Choice 
choices. The following is a diagram. 



C 


The diagram shows that the subjcci can Mn 

for a pvea event at trial t: Zi. Zi. and Ceceeds to determine 

explanations of the most recent event. subj ,jj,ayon during the next 

whether or not he should “guess opposite to „X- .han the explanation 

trial .• This choice process is somewhat more complex than 
process for trial rand will "o*^*^'**^*^*^ at A where he evaluates 

The subject's initial choice (see diagwm) pattern itself could 

the pattern he used in predicting the event or chooses explanation 

have successfully predicted the event for tn ■ ^ an instance 

Z, (which is simply the explanation t at e ^ progression of J's, 

of this pattern). For example, if the pattern explanation. If. however, the 

and if the event was a ,/. 1**^“ X* ^Hicted the event for uial r, then the 
pattern could not have successfully pre could have predicted 

subject moves to B v,h=re te oote, .ho* pa 
lh= event successfully, and he focuses o.. patterns 

the event at I - 1 as » ell as the Uten ttiotes to C, »here he 

Having evoked a set of patterns, the su ) selected most 

makes a choice among them. He selecK Utat pattern 


P.34,. .h,vh 

*To be more precise, iherc -re in^ t™! , + 1 'guevs opposite lo ii ^ 
subject in the Sness-opposite Pfo«“- ^ j -guess opposiie" lo ^ 

(3) 

prethction . 

• It shoulJ be ooieU that the -r«j«3ed correctly the e'en s 
mat 1 IS also evoked if it could have preo 
and r — 3. 


311 



SOME .PrUCATlONS OF LOC.C AND ^ 

m»s.raao„s. sc.cc, ad for dr6r« >“ 

theory of choice aid in iu development, 

slatemcnls (lopeal or malhcmaticnl) may 

n.2 THE LOGIC OF CHOICE 

A Description of on E::p^rimM i„dividnal. 'vhi* P'f 

Let us examine an experiment on a " “ked to predict whether 

under highly controlled conduits. ^ 200 trials, and he 

a plus symbol or a ehcclr symbol would ° possible. After he 

was instructed to make as man, eotreet P''" jid ™cur, The selec- 
made eaeh predietion. he was to d but the subiect was 

tion of each event was determined by a random p 

not aware of this tnveriieatc the cognitive process whi 

The aim of the experiment was to ‘“'“"P* ‘ yhe W'S' technique fot 
led to the subject's predictions of the set t ' jabject is timpV 

collecting data is called the l/iinlins P . j ,be protocols which 
instructed to verbalite as he Is pctfotmiog the tesk. and p 
emerge deal directly with the task. d to be quite 

The cognitive processes involved tn the subicc s cho e p 
complex. The decislommaking P'»““ ° ani *= P"' 

the explanation of the event which had '^,o“ jj well as other 

diction of the next event on the basis of thd exp ^^als presented to 
factors. Because the subject was not informed that lb J^m^ P,^ ^^s- 

him were generated by a random pr^. he assumed bis cog- 

sible to "explain” the occurrence of a given symbbl- In P ^ ^^bavior 
nilive process represented an attempt to rraderstand P which 

of the experimenter, who he believed to be making t 
symbol should appear on a given trial. 

An Expfonor/on o/ the Binary Choice Process in the EspertmenI 
Analysis of the protocol indicated that the subject 
an element in one or another of a set of possible pattern i - 

. . . JW . . . ++++ ^^++.//++/v'++.^''tl; 'one of these 

first stage of the decision-making process, the subject sci 

patterns as the basis for explaining what happened with resp .gceding 

that most recently occurred. Let us call this the event at tria ^ 

events will be designated the events at Uials t — 1 , < — 2, 1 . 

We may diagram the process whereby the subject arnves at an 
of the event at trial I as a series of choices, each of which imp i 

n ..“in Harold BotWo 
* Julian Feldman, “Computer Kmulatioa of Coemtive Processes, i rt, fj J.: 

(ed ). Computer Appltcalton, « ihe Behaxiaroi Saences (Englewood CU . 
Pretiliee-Hall, 1962. pp. JJ6-59). 



Sec. 17 J] Choice 

set of options or alternatives for future choices. The following is a diagram, 
adapted from Feldman,^ of these choices: 


The diagram shows that the subject can arrive at three (j,ese 

for a given even. a. triai Z„ Z. and j'^“e 

eipiMations of .he mos. recent event, ‘“”1“ Ja.ion during the neat 
'whether or not he should “guess opposite to this p explanation 

trial • This choice process is somewhat more complex than P 

process for trial t and will not be discussed here. . >^here he evaluates 

The subject’s initial choice (see .. pattern itself could 

the pattern he used in predicting the esent to • explanation 

have successfully predicted the event for tria . ^ ^ instance 

2i (which is simply the explanation that the csen ^ pgression of /s, 
of this pattern). For example, if the pattern „_-i,naiion If, however, the 
and it L even, nas a J. then he tor trial r, dven .he 

pattern could not have successfully ^ that could have predicted 

subject moves to B where he evokes those P“ ,j f,avc predicted 

the event snccessfnlly, and be foenves on pnltcrns tn 

the event at r - 1 ns vv ell as the t'™' ^ , ,|,t„ moves to C. where he 
Having evoked a set of patterns, t nattern which he selected most 

males a choice among them. He selecB lhar pattern 

•To b«^ more preeive. itKre j -fuew oppowie co ^P**"*' 

subject ia the eue>s-oppofail« rr«<»: ?'* , ©rpoM'e" to ihe piiwrn 

lion ot .he even. a. iml /: l2) in n«e lha. the ‘‘ "f , 

uveJasihebamof .hee»rl»"»“«"*‘“*^ . -Thro* r« oft’ evpUMUon). (3) uie ihe 

av the explanation, for .he Uuer mifi . exent »i 1'“' * I* ,, 

txplanauon of ihc even, at .rial » w Pi^‘ to .he unvucccufol « 

•It ihoulJ be noied that the owrectly ihe evtnn of tiuU / I. ~ 

trial I iv alio evoked if n could nave pre 


and 



SOME APPLICATIONS OF LOGIC AND MATHEMATICS 

otan on nil of Ihc prcocdiog trinls. ^ ntojVh' s'^unsLcV^^^ 

decides »hcthcr or not the pnttera prediction, 

prediction for trinl t. If it »n, 7 ‘ “"“J" the event a. 

then he chooses explanaUon Z, ('**“«** '* ^ oattem selcclcd bad just 

trial t is an instance of this pattern). II. hottw , P object chooses 

t«:en utilized m the unsuccessful prcthcuon at tra . ^cn h s^ 
explanation Z, (this explanation of >xhat occurred at tnal / « tna 

mentertriedWthrowmeofr’). ,.„„rience at trial f, the 

After arriving at an explanation for the c'^nt or 
snbiect then goes through n complex " Finally, 

he decides vshether or not to "guess opptjite to this exp™ 
he arrives at a prediction of a + or a y for “ . jubiect chooses 

The foregoing explanation of the process through . the basis 

a + or a /for a given trial was developed by the sym- 

of the “ihink-aloud” protocols as well as . • correct is not 

bols in a series of trials. Whether or not this «planat on 
known, but some evidence may be provided by ° „mn,on usage 

the entire series of choices made by the subject. A ^ j behav- 
the concept preJ.ahn refers to events that have ’explanation 

loral scientist sometimes uses il m a <***^®"*'*^^"*®" ••orediction,” 
which reproduces the choices actually made. Thus. ^ con- 

in this respect, is analogous to a high coefficieni oj reproducib y 
text of Guttman scaling. lo make 

Although it would have been possible for the experimenter him ^ _ 

••predictions” on the basis of the explanation given, together wi* h S 
opposite” explanation for each of the 195 trials run (after a p 
of five trials), this would have been quite laborious. To make ^ ^ 3 . 
feasible, an elccUonic computer was programmed so as to use mese t' 
lions in predicting the subject’s choice on each trial. Here the compu 
tioned as a kind of •‘logic machine,” simuIaUng the logical operation 
subject in making his choices. _ jvtinn bv 

One potential difficulty in this process arises when a given 
the experimenter (or by the computer program) is erroneous. i^boice 
gram's erroneous prediction is assumed by the computer lo be the Mtua ^ 
of the subject, then the original error would be compounded, for the comp 
would be operating on the basis of incorrect information. The grea er 
number of errors made, the greater would be the compounding of those erro , 
and the result would be a highly inaccurate series of predictions. 

This difficulty may be avoided by introducing the idea of condiliona pr 
diction. Whenever the computer’s prediction differed from the subject s, 
subject’s prediction was su^dluted for that of the computer. Of course, 
was still counted as an error for the experimenter’s explanation (or the com 
puter program) . By this technique, howCTcr, no given error was allowed to e 
compounded into other errors. As a result, when 195 of the conditional pre- 
312 



Sec. 17.2] Logic of Choice 

dictions of the computer program were evaluated accurately, 193 were 
correct and only two were incorrect. 

It is not only the experimenter’s explanation of the subject's final choice 
of a symbol for a given trial which may be tested, but intermediate explana- 
tions as well. For example, one of the three possible explanations that Ae 
subject may develop for the most recent event is that Z\ (i.c., the pattern which 
was the basis for his previous predictions at trial i) actually was the correct pat- 
tern. Actual data are available from the subject (based on his think-alou 
protocol), which indicate whether he in fact came to this conclusion when 
it was warranted. In 1 17 out of 120 trials where the pattern previously u«d 
would have in fact been the basis for a correct prediction, the subject rct^s 
this pattern as the appropriate explanation. This constitutes an intermediate 
check on the experimenter's analysis. 


The Import of the Experiment 

This binary-choice experiment illustrates a number of different points with 
respect to the language of logic in particular, and the research process an 
behavioral science theory in general. , . . 

I. A rather obvious difference between this logical formulauon and a roa^ 
ematical one is that the former involves very little use 
symbols or mathematical operations. There is one , P . ^ 

involved in the subject’s attempt to explain what occurred at , -monz 
occurs in situation C (see diagram). Here the subject selects o .„_,rical 
a set of possible patterns, and the basis of his V “““ S 

comparison. In particular, he selects the pattern he had selected most often 

of SlS 

explained the event at trial I, as illusuatcd by t e a p 
sintaar to the diagtant, of eaosal processes presented 
Merenee has to S. with the tact that the present s.tuauoa do not 
on the process leading to one possible p^ess leading to 

In the eleetrie-light example, we were *,„a™nL Here how- 

the lamp turning on, and no ailcmatives were ta cn t ^ 

ever, thL di Jrent alternatives may 

In the following diagram, we will ^ types of diagrams: 

Of Zs SO as to demonstrate the relationship between t yp 



, • o ■ v-v Z3 


J 


313 



OTME APPLICATIONS OF LOGIC AND MATHEMATICS 1“ '’ 

Here, A, B. C. D, und Z. ita Treptos Z^rd'F 

the basis for his prediction of the ^ situation B, he must 

tale ieto account. In order for the subject to z.. Also, in 

avoid this choice; othervrise, he urtll come up h f 

moving from D to Z, a “"““S’ 1 selected was just utited 
refers to the subjeefs decision that the PK' " ^ 

in his unsoecesstul prediction of the back and forth 

3. Computer simulation in and the data he is 

between the researcher's explanation . tals of this research 

attempting .0 esplain and predict. J^^otVar” snbi= «d « test, are no 
procedore, in which hypotheses and theories ate snoje 
different from those involved in other types , p„posiUDns, or a 

4. This experiment illustrates the tesung of a system P ^ j„,oived 

thcoiy, rather than one or a series of iMlat^ ptoposnio s ^ V 

u quue similar to the theory of choice J 

may be understood if the research situation .$ analyzed for corre y 
between the two. , . . _„n,bet of correct 

First, the subject is instructed to attempt to ^ j ^niy one goal 

predictions Thus, if we assume that this insiroction is fo . gboices. 
on the part of the subject guides his choices. He then how he 

and each choice would seem to be based on his xiect is decid- 

mightbest achieve this goal. In situation A. for he pre- 

ing between two alternatives: Z, and B. Z, is chosen if t P 
viously chose to be the basis for his choice h[s oattern was 

proves to have been an accurate basis for his choice. Thus, P gjjij. 
instrumental to the subject’s being able to achieve his goal o 
tion at trial I. and the subject seems to make the assumpti 
continue to be eUcctive for trial r + 1- On the o^er han , 
proves to have been an inaccurate basis for his choice at tria , 
move into situation B and proceed to look for other patterns. 

It should be noted that the important expectations of the subject 
have to do with two types of phenomena: (1) his expectations as 
one among several alternatives provides the best chances for ac 
goal of accurate prediction, and (2) his expectations as to w a a e 
or options exist in a given situation. We may thus classify t e 
sequence of behavior at each point along the road to final choice ° ^ ^ 

(1) his expectations as to the alternatives or options. In situation A, o 
ample, the alternatives arc B and Zi. (2) his expectations as to w 
among the alternatives will provide the best chances for achieving his goa 
accurate prediction. These expectations will depend on the specific criKria 
has set up for evaluating the probabili^ that each alternative will lead to a 
3U 



Sec. I7JI The Logic o/ Choice 

accurate prediction. (3) his choice among the alternatives which he has set 
up. With respect to situation A, for exampic, the subject chooses between 
Band Zi. 

5. Unlike investigations such as the medical student study, this experiment 
has special advantages for the uncovering of the time order of events. By 
means of the “ihink-aloud” protocol, intermediate states leading up to each 
choice may be investigated. For example, it is possible to determine whether 
the subject actually arrives at the explanation which the theory predicts for 
him. On the basis of such determinations, the theory that applies to different 
aspects of the causal process may be checked out, and we are not limited to 
an all-or-none decision as to whether the theory is correct or incorrect 

6. This expieriment provides an opportunity to test the theory on a large 

number of choices by a single individual. In contrast, the medical student 
study focused on one choice which was made by a large number of r«porid- 
ents. It is rather unusual to test a theory utiluing data from only one individ- 
ual, but it is possible. The results would, of course, be more impressive if the 
same computer program were able to successfully predict, in a subsequent 
series of 195 trials, 193 of them. Nevertheless, the unit of investigation is 
not the individual but the choice he makes, and the relevant number of units 
is not 1 but 195, There are both advantages and disadvantages iusociated 
with focusing an investigation on one individual. For example, it is entiM y 
possible that the sequence of events may have been relatively unique to that 
particular Individual. Thus, the same program might not work well for many 
other individuals. The theory, however, is more general than the prop^. 
Thus, quite different programs developed for various individuals might have 
in common the same general theory. For example, individuals may set up 
different alternatives for themselves and have different cxpectaUons as to 
impact of choosing a given alternative on the success ® 

of trial I. Yet, even with such differences, these individuals all might be 
attempting to maximize expected utility in each of iheir decisions. 

A possSjle adv.n.,ge of foeosing o„ .1.0 bohav or ot U.a 

may be oo aubs.i.u.e for ge..ing o. "be f 

.he si.„a.L,” it ..e are .o eaplain and ped.c. h.s heha™. Jn h.s^prer. 
deotial address, “Bringing Men Back In. ®“^Se jjj 

posirion: “I, a serinns effor. is made 

to explain social phenomena. It turns out u nf men ” “ 

not about the equilibrium of 

7. By condnedng a great deal of closnre. A 

conditions, the investigator was able to ac 6 

. introduced by W. I. Thomas in The 

'“The concept “definition of th® 4 t_l 4 ). 

Unadinsied Girl (Boston: Lilli®. Brow^ n-It In" American Sociological Reiieyv, 

"George C. Hom.ns, -Briegieg Men BKe m. 

XXIX: 818 (1964). 



SOME APPLICATIONS OF LOGIC AND MATHEMATICS ” 

Standard argument is that the this argument is 

the results cannot possibl^jr have ^ ^ jj^jtion js detennined by ns 

the assumption that « „„i:iving reality is comprehensive 

appearance. The seientisfs best tod for perceiving J ^^,55 

Tory. Thus, .hen an »perimental s,m« ^ .e„p= U *■"; 

must be tested in everyday settings as well, pioportion 

8. The degree ol success ol the c.mot be ignored, 

ol the subject’s choices which were succeatully P j predictions. 

Although the predictions were conditional ^ “rmpressive, «» 

the achievement ol 193 correct predictions in ‘ ?, ,, this kind of 

the contest ol a pragmatic approach to I”'*”* j;', toestigation. « 

result that serves to lend weight to the ideas in 
the student of behavioral saen« is hewildered by * t k 
researches which ate most “c ' ‘ c° racy ol predictions, 

to the magnitude ol cottelations and. in have'very'^ iillle scope 

Although high cottelations and aautate ”, J^po“i,ions^and theories 

and ibus be quite trivial, when they are attached to proposmo 
of considerable scope they may be exuemely s‘Sn»ncaDt g^-ianations 

9. There is an interesting analogy between former 

and predictions, and the subject’s explanations and general 

dependent on the latter. The subject did net. however, ve bal.« h« 
procedure for arriving at explanations and predictions, 
was not aware of it. The experimenter’s theory represents “ 
general procedure, as checked against the actual behavior of the subject. 


17 3 A MATHEMATICAL FORMULATION OF A THEORY 
OF CHOICE BEHAVIOR 

The Galanier Study and the Medical Student Study Compared ^ 

Underlying the approach taken throu^ much of this book is a ° 

choice behavior in which the choice among alternatives is viewed as a u 
lion of their relative expected deprivations. The formulation of the . 

expected value deprivation represented an attempt to provide a mathema i 
statement of the theory within a specific context. Section 12.2 presented so 
of the work of Galanter on a mathematical statement of the choice 
which suggests a different mathematical formulation that might be app le 
the medical student data. In tins section we shall first summarize the relcvan 
work of Galanter and then proceed to develop its implications for a new 


316 



See. 17J] A Malhematical Formuhlion oj a Theory of Choice Behavior 

approach to the medical student study. In the process, it is hoped that the 
process of developing a mathematical formulation, as well as the functions 
of such formulations, will be clarified. Galanter summarizes the purposes of 
his study as follows: 

The present study was designed, therefore, to discover whether direct 
measurements of utility and likelihood are possible. The study divides itself 
into three parts: (1) the measurement of utility; (2) the measurement of 
likelihood by three methods; and (3) the prediction of choices made by 
people when »e have knowledge of the likelihood and the utility of the 
alternatives.’* 


His basic proposition is that the probability of choice of a given altemaUve 
depends only on the difference in the expected utilities of the alternatives. 
Thus, for example, Galanter attempts to predict the choices of his subjects 
as between being given $1.00 if the sun rises tomo^w and S50.TO if a penny 
turns up heads. He measures both the subjective utility of the various amounts 
of money and the expectations of his subjects as to the likelih^ of each of 
the two events. He then calculates the subjective expected «hhly of each of 
the events by taking the product of its hkelihood and uUlity. This product 
constitutes bis estimate of the probability of the choice of each event, and 
this prediction may then be compared with the actual choice behavior of the 

“There are three basic differences betweeo the Oalaater approach and that 
used in the medical student study: 


l. The measures adopted by Galanter are based on scores averaged for 
the groop as a whole. Also, be does not attempt lo 
of a^/veo mdividual; rather, he focuses oa to 
withii L group as a whole. For eaample. the o 

choice of a^ven even, is compared as Ute “ 

was chosen over the alternative choice among all of the subjects. Aver- 
aging teehaiques for to groop were 

uSiv and to subjeedve prohabihty of ^ 

student study on to other band, the ..dues and eapectauons of each 
sm^m were imputed sepmately. and “-“S 

Uic medical Belds also were made separately for 
2 The mathematical loimulalion of choice as a piquet of subjective 
pobabiUty and uUlit, diflere from to res' '''™° 

to medical s.udenl sludy. There, d »•“ 
involved to difference between > ‘m. .Sc 

Uon rating as to to opponomoe, for lulHImg to value. 

i» c e- 1 AtAP -Tlie Direct UiJay Subjective Probability." 

'• Eugene Gelanter. "me Uir^ tl9«|. ’’ 

•Imencan Journal of Ps^xhdogJ. LXX 


317 



SOME APPLICATIONS OF LOGIC AND MATHEMATICS 

3. In Ihc medical sludenl sludy, a medical fields. The 

single decision — i.e., preference for ,_l.. „f diderent types of 
Galantcr study, however, ‘**^j!* ^ took into account only the 

choices More important, the Galantcr st^y ^ ^ ^ 
utility of each event m a whde differences beWcen 

ent components. In the corresponding values 

value scores and expectations for ^ iJn values, -ntts 

in a given medical field were was calculated for 

summed difference score for a given medical h 
each medical smdenl lor the medical field m quesuon. 

It »oold seem that each approach hM drfSn ol 

medical student study took into account the ‘ ^ ef values, 

situation, and the measurement of utility »as ,,, ,„bjceUve 

The Galanlct study, on the other hand, employs t piobabihly. 

probability, uhich is a possible inleipretation ^ ^ ,bjch has re- 

This ioterptetaUon dillets (tom the treqoeney ’ .Ung process, 

suited io many uselul applications nt ‘““"'“'/l. interpretation, and 
Whether the akioms of probability will hold (or ihi JP ..plications 
whether the theorems which follow froin ihcm will hate nscl PP 
for the study of human behavior, remains to be seen. 

A Combined Formulation ^ 

It is possible to utihie an approach which combines of the 

of the above orientations. Such an approach could re am P 
medical student study on performing a separate analysis for for 

whUe at the same time shifting to a substitution of P deprivation 

expectation. It could also be designed to Uhe into account bo P 
and fulfillment, and not merely deprivation alone as is the case i formu- 
student formulation. This might be designated as an expectc u p'[.QA 
lation, as distinct from calculations of expected value deprivation t ‘ ^ 

One expected utility formulation which lakes into account both 
and deprivation derives from the concept of “mathematical exjwcta lo , 
idea which has long been in use by mathematicians and sialisucians. 
first illustrate it within the simple context of a gambling garne before app y ^ 
it to the medical student study. Suppose that an individual is trying to e 
whether or not to play a shell game where he would be required to ^ 
which of three shells a pea is under. Suppose that if he guesses ^ 

receives $10, while if he guesses incorrectly he is required to pay $ • 
question he might pose is whether he stands to gain or lose in the long 
if the game is an honest one and if he makes his choices on a random • 

For a given choice of a shell under these conditions, the probability that 
choice will be correct is one-third (p = V6 ) while the probability of an incor 



Sec. 17 J] A Malbemalical Fomudalion of a Theory of Choice Behaitor 

rect choice will be two-third (1 — p = %)■ He then has a 

ofgainingSlO (K, = $10) andattto-tbirdschanccof losing S4 (Kj - J- } 

on each guess that he makes. One mi^t consequently say that is 
expected gain for each choice he makes is one-third times SIO p us tw^ 
limes S4. or (1 - p)J'*. This comes to -/a ■ $10 + % • (r54> °r 

53.33 - $2.67, or $.66. Each guess then is worth $0.66 to him, 
expect to gain somewhere in the nei^borhood of $66 by m ng j j 

The same idea might be illustrated within the context of die “t 

study by calculating the mathematical expectation for a given s 
only one medical field and one value arc concerned. t us ^ ^ 

Si>cn studeM i, very interested in resenrch (V = and. 
he is negatively oriented to an equivalent extent to being nrob- 

activiti« (~V=. _3). Let us also assume that he feels there is a pro^ 

ability (p =s 0) of being able to do research in activities in 

versely, he is certain (1 - P = 1 ) depme o ^ 

this field. Then the expected utility of entering gener p ♦ 
the value of research into consideration, is 

py+il-p)(~.V)=:pV~(l -pyy =(0)<3)-(I)(3>=0" ^ - -3. 

Of course, a calculation of the expected £c rele- 

ttould involve repealing this procedure for ea^ .nressed symbolically as 
vant and then taking the grand total. This might be expressed symDoiicai y 

Expected utility = 2 (PiP"* - ( 1 “ 

We may now inquire as lo tlow (he 5 ,u*nt°s'rudy! Allhoush the 

With the EVD approach utilized m the medic oicdical student 

cjpecled ulilily formulation has not )cl i„hion on the basis of 

dala,wemaynetcrthelcssesaluateil in^ 1^^^ inunediale: Is Ihe 

a few examples. Two questions m particw formula- 

expected utility approach sensitive to depma . ioherent in the 

lion? Is it sensitise to fulfillment, a capac y 

EVD model? , i . imniied in the abos« Ulusiration 

The answer to the first question is alrca > siiuatioa insohed 

of the procedure for calculating cxpcctc u • jy 

an individual with great interert » . -jjj . cAleuIalcJ was -3, with 

research in general pracliw- Jcprisation. In addition, the 

the fact that this quantity is negative ^ of deprivation. Thus, the 
model can diflercntiatc between d»ncrcns ^ £VD formulation 

expected utility model seems w reum the aUvaniago 

iaihatitisscnsjiisciodeprivatKm. live same inJivklual's reaction 

As for the second question. research into cossijerauoa. 

to the field of pathology taking / „ n ni being able to do research as a 
Let us assume that he is certain (P 



SOME APPUCAT.ONS OF LOGIC AND MATHEMATICS ^ PhT’ 

pathologist aod, conversely, that exSae/utility ol entei- 

Lpiived ot research activities tn tb» held. Then h. eaF« 
ing pathology where only the value ot research is involved 
p,/ _ (1 - p)V = (1K3) - (0H3) = 3. 

By yielding a positive score ol Ihm s*Svel'o hilHlment. 

(vs. deprivation) is involved, this model s«ins b ^5 „odel 

A lutther check on this sensitivity ° us assume that 

distinguishes between dilterential depe» »' '“'“f learning whde 

the same student has a medium degrM learning would be 

practicing (H = 2) and is certarn 

available to him as a pathologist (p - D.lncnmec p 

has for him with respect to this one value is 

p;/_ (1 _p)|^ = {I)(2)- (0){2) =2. 

nms. the model does seem to dilTercmiale 
fulfillment by yielding the scores of three and two for situahons o 
of a strong interest and fulfillment of ao interest j the EVD 

This sensitivity to degrees of fulfillment was not p j, (y s 3) 

model. In the case of a student who is very ~ 3 ), the 

and who is certain of being able to do research as a pa^oleg^t tfi ^ 
expected value depnvation involved would be " f certainty 

student had medium interest m conunued )e^“sg t “ /ffs3), the 

that opportunities for learning would be only those 

expected value deprivation would be zero also. This is ^ 

simations where V is greater than E enter into the ° ^ „pected 

The idea of mathematical expccUtion. which is decision making 

utility formulation, is one of a class of mathematic^ rnodels oj in. 

which has been designed for situaUons where ‘ risk or p decision 

volved. Luce and Raiffa distinguish between models having ^ 0° . 

making under conditions of cerUinly, conditions of risk, con ‘ 
tainty (where the probabilities cannot be estimated), and j,gai 

there is a mixture of risk and uncertainty.** Although numerous "lathemaW ^ 
models have been developed for each of these types of f ’ , 'idling 

erally involve procedures for maxiimzing fulfillment and/or 

U is extremely easy to find many flaws in any one of these 
The EVD formulation, for exam^c, completely omitted taking lu 
into account. In addition, the value and expectation measurements^ 

»»R. Duncan Luce and Howaid Raiffa, Camis and Decisions (New 
Wiley and Sons. 1957). Game theory eenerally has to do with situations 
is a mixture of lisk and uncertainly, linear programing provides an example 
making under certainty, while slatistical decision making has to do wiin 
where there is uncertainty as to the stale of nature. 



5 «. 17 J] A Mathematical Formulation of a Theory of Choice Beba.ior 

limited to a three-point scale, whereas some degrees of intere^ 
of expectation may have been many tiroes as great as o ers. u • 

measurements were based on the written statements of res^n en > 
may have differed from the actual situation. In spite o t cse ^ *£, 

mneta, however, the EVD tormolauoo yielded eo mdex '*« P™™ “ 
be very closely related to the probability that students wou . _ 

field unfavorably. This seems to indicate that there are rather than 

volved in testing what appears to be an extremely rough model rather than 

abandoning the test altogether for want of as was the 

Thr erpoottd utility “'■‘i'l ”'8^* “rirocelro would L aualogous 
expected value-depnvaUon model. Here, the p ^ determinadon 

to that described lor the EVD foimidation to C P ^ 

could thus be made as to whether students with uteh expected 

given field tend to rate it untavorably, and whether stndenls wttb h.gh expected 

utility lor a given field tend to tale '* ,„io„ involves a sum ol 
It should be noted that the expected utility stated that 

products. In the discussion ol talio ^ ^ investigator to formulate 

tauo scales are quite uselul m development of ratio scales 

product relationships among variables. Thus. valuable in testing this 

of subjective probability and utility would 

fonnulation. . . ,w_, {. is esseDtially static— 

One important limitation of this forroulauon ...jinjinary choices which 
i.c., it does not take into account . ^^>^5 present a theoretical 

result in a pven choice. Its advaouge is > xhe lopcal formulation 
formulation of the mechanism of any ^ ggj concern with such 

within the binary-choice experiroent 1 i» presented whereby a given 

sequences. There, however, althou^ ^emseUes must be placed 

choice is made, the development of these achieve a more compre- 

wiihin the framework of a general theory m dynamic formulations 

hensivc explanation. To the extent mat su . will be able to explain 

sre put tn^er ellectively. the behu.mtul sccuust 
processes occurring over a period 0 tim . 

ImpUcallons for Measurement . proves to be cficctive, it 

If a pven mathematical ^romcnl process itself, for effective 

may provide considerable aid in tnc measurement situation, 

theory can provide us with an . '^„cjcd-uulity formulation is effective in 
For example, let us assume that Ow ^ die initial conditions involved- 

predicting an individual’s choice > ^ subjective probabihty as to oppor- 

i.c., his relevant goals or 'aluM ^ dje available alternatives. Then it 

tunities for achieving them wni die individual’s values if we knew 

would be possible to ‘ ^ Ids actual choices, 

both his subjccuvc probabilities anu 



[Ch. 17 


SOME APPUCAT.ONS OE LOO.C AND 

Mote speciBcally, there ate tlii« f J“",„Led in using the fiist 

probabilities, and choices. Ordinarily wc m ^ jjiird 

L to explain and ptediet the tWrf. but ™ ““ “'e so as to ptovide 

in otdet to ptediet ot measute the Bnt^m ^ considetable soetal 

an indiieet method of ">““nng those „toceduie is an etample of 

desirability ot social “"'‘"'"‘’'“J' The approach need not 

a consttuct-validation ptoceduie (see Secnon ) measured 

be eitcular, for ue need not turn nghtarou^anduse the vm , 

,0 "ptediot" the same choices X.^tolieh ate made by the indi- 
these measures of values to the effideney 

vidual. As in the general ““7";”*tof thl aceutaey of such pte- 

of such measurements is evaluated on the basts o 

"‘t;:averyfewofthewaysin«hiehl.^ieandma.to^-3y^ 

to out understanding of human hdhaviot hav^een 

indeed, dte purpose throughout this ^tTLovatil in this Md as in 
enhaustive compendium of research "“’h^ ; student can hope 

others continually take place, and the I”;' C volume can aid the 
foi is the ability to learn about them as they criUcally any 

student in developing this ability and encourage him to m 

given reconstructed logic, including the foregoing one, then 
pose will have been served. 

SELECTED REFERENCES FOR FURTHER STUDY 

JsMEs S. CoLEMsir, Introdncnon ro Mor/.CTmi.col Soeiolosy ^5 b^eces- 
Press of Glencoe. 1964). Although some background j this 

saty for a thoroogh understanding of the work, much can i^Uhout such 

book abou. .be dcvclopmenl. m .he field of “‘hema.iea models 
a background. A few of the topics are: uses of malhemat cs language for 
Urns of quantitative measurement m sociology, . mathematics, 

relations between variables, and tactics and strategies in the 

Martin Shub.k (ed.l. Game Theory and Relied Sk k^^wfede^ 

(New York: John Wiley and Sons, 1964). For the reader h«^ 
of mathematics, a sound introduction to the nature of game , are 

a variety ot applications is provided. Several articles of genera 8 . ^nd 

included— c.g., Abraham Kaplan's “Mathematics and Social Anaiy 
Jacob Matschak's “Scaling of Utilities and Probability.” 

A similar approach to the indirect measurement of utility " earn* 

See. for example, Jacob Marschak. “Scaling of Ulililies and York: 

Shubik (ed.). Gamt Theory and RetaWif 44 pproacfter fo Social Behavior ir» 

John Wiley and Sons. 1964. pp. 95-109). 


322 



Selected References for Further Study 

Chris A. Theodore, Applied Mathematics (Homewood. 111.: Richard D. 

1965). The student with no college mathematics who has done well in hig 
school mathematics courses may here be introduced to the important mathe- 
matical topics of set theory, algebra, anal>tic geometry, differential and integral 
calculus, and probability. By incorporating numerous examples from business 
and economics, the author demonstrates that it is possible for social science 
studenU to obtain an initial background in such fields with little or no college 
mathematics as a prerequisite. 



Index of Names 


Adams, Richard N., 
Angel], Robert C., 
133 


104, 142 a., 143 
126 D.. 127 D.. 


Coser, Lewis A., : 
Couch. Arthur S., 
Criswell, Joan H-: 
Crortbach, L. J., 


12 n. 

173 n.,201 n. 

309 n. 

158 n., 164 n., 169 


Back, Kurt, viii, 2 n. 

Backstrom, Charles H., 124 
Bales, Robert F., 141, 173,201 n. 
Becker, Howard S., 142 
Berelsoa, Bernard, 25, 51 n., 64 n.. 

76 n., 133 n., 264 n. 

Berger, Joseph, 309 n. 

Blalock, Hubert M., Jr., 277, 300 
Borgatta, Edgar F., 35 n., 36, 212 n. 
Borko, Harold, 152, 194 n., 310 n. 
Broom, Leonard, 306 
Brown, Roger, 104 
Buchler, Ira R., 1 89 a. 

Bush, Robert R., 309 a. 

Camilleri, Santo F., 284 
Campbell, D. T., 123 
Cattin, George E. G., 279 a. 

Chappie, E. D., 141 n., 142 
Chave, E. J., 201 n. 

Chemoff, Herman, 262 
Cohen, Bernard P., 309 n. 

Cohen, Morris R., 87 a., 88 n., *•-» 

309 

Coleman, James S., 309 a., 322 
Coombs, C. H., 179 n. 


Davies, Vernon, 286 
Deri, S., 175 n. 

Durkheim, Emile, 129, 279 

Edwards, Allen L., HI, H2, 192 n., 
194 


“eldman, Julian, 310 n., 311 
restmger.L..179 a.. 193 n.. 194 
Rncb, Henry A.,36n. 

Friskc.M., Hln. 

Beniamin, 171, 195 


Galanter, Eugene. 205, 206 a., 207, 
210, 309 n., 316-18 
Gaudet, Hazel, 264 a. 

Geer, Blanche, 142 
Glueck. Eleanor, 52 
Glueck. Sheldon, 52 
Goldwater, Barry, 220 
Goodenough, W. H., 192 n. 
Golischalk, Louis, 126 n., 127 n.. 133 
Gouldner, Alvin W., 67 
Greenwo^, Ernest, 105 
Guetzkow. Harold, 147 n., 148, 152 


335 



INDEX OF NAMES 
GuUikscn, H., 164 n. 

Guthrie, EdwinR. 136 
Gunman, Louis A., 187, 189. 192 . 
193,194.312 

Hammond, K. R-, 123 n. 

Hammond, Phillip E.. 26 
Handy, RoUo, 70, 76 n. 

Hayakawa, S. 1.. 27 n. 

Hays, William L.,277 
Hcmpcl, Call G., 35 n.. 45. 46 n.. 
160 n. 

Heyns. Roger W., 141 
Hinkle, Giscla S.,83 
Hinkle, RoscoeC., Jr-. 83 

Holmbcrg. Allan R.. 104 
Homans, George Casper. 86. 315 
Hull, Darrell. 230 n., 262 
Hunlington. E.. 180 
Hursh, Gerald D.. 124 
Hyman, Herbert H., 124. 300 

Johnson, Lyndon D.. 220 

Kaplan, Abraham, vii», 47. 54. 57 n. 

58 n.. 309. 323 
Kaplan, Morton A., 149 
Katz, D., I79n, 

Kelly, E. L.. 194 n. 

Kemeny. John G., 309 n. 

Kendall, Patricia L., 1 1 1 , 1 34 
Kerlingcr, Fred M., 238 
Kilpatrick, F. P., 194 
Kish, Leslie, 277 

Kluckhohn, Clyde, 126 n., 127 1 
133 

Kurtz, Paul, 70, 76 n. 


Undiey, Gardner, 1 19 n., 134, 212 n. 
Lingoes, J. C., 194 n. 

Uppit, Ronald. 141 
Luce, R. Duncan, 309 n.. 320 

Maccoby. Eleanor E., Il9n. 

Maccoby, Nathan. 119 n. 

Madge. John. 26 
Marschak, Jacob, 322 n.. 323 

Massarik, Fred, 309 n. 

McCinius. Robert. 195. 246 n.. 

309 n. 

McOuitty. L. L..213n. 

Mcchl. Paul E., 169 
Merton. Robert K.. lU. I-*-* 

Miljovska. D.. 222 D. 

MiU. John Stuart. I® 

91. 171.226. 229-30.285,28/. 
290. 299 

Miller, Delbert C., 169 

Millcr.Ceorge A..309n. 

Mills. C. Wright, viii, 31. 66, 67, 

Montague. Joel, viii 
Moore, Frank W., 134 
Moreno, J., 212 n. 

Morris. Albert, 52 
Moses. Lincoln E-, 262 
Mueller. John H,, 279 n. 

Murphy, G., 184 n. 

Nagd, Erircsl, 47, 87 88 n., 89 n.. 

309 

Nelson, Benjamin, 67 n. 

Newcomb, T. M . 123 n. 

t n., 

On:uU,Guy H.. 152 
Osgood, Charles E., 210-11, 215 


Lambert. WilhamW., 136n. Parsons. Talcolt. 66-67. 221 n. 

Lazarsfeld. Paul. llSn.. 133, 134, Phillips, Bernard S., 8 n.. 2U9 n. 

176, 238, 264 n., 301 Planck, Max, 57 

Lewis. Oscar. 143 Polilz, Alfred, 1 32 

Likert. R., 184 n., 194 Preiss, Jack J., 104. 142 n., 143 


328 



INDEX OF NAMES 


Rdffa, Howard, 320 
Ratoosh, Philbum, 309 n. 

Ray. William S., 105 
Riley, John W., 187 n. 

Riley, Matilda White, 86, 187 
Roberts, Harry V., 24 1 n., 262 
Rogers, C. R., 1 1 1 n. 

Rome, Sidney, viii 
Rommetveit, Ragoar, 86, 213 
Roper, Elmo, 132 
Rosenberg, Morris, 238 
Roth, Julius A., 142—43 

Salmon, Wesley, viii 
Shuhaney, Elizabeth, 254 n. 
Sellin, Thorsten, 208 n., 216 
Selltiz, Gaire, 164 n. 

Selvin, Hanan, 246 n. 

Selznick, PhUip O., 306 
Senders, Vir^nia, 177 n., 199 n. 
Shils, Edward A., 36 n. 

Shubik, Martin, 322 n. 

Siegel, Sidney, 298 
Simon, Herbert A., 300 
Snell, J. Laurie, 309 n. 
Snow,C.P., 32 n., 70 
Solomon, Herbert, 309 n. 


Stephenson, William, 174, 176 
Stoufler. Samuel A., 169, 176 n., 
187 n.. 193 n., 194 n., 301 
Suci, George J., 210, 215 
Suchman, Edward, viii 
Suppes, Patrick. 309 n. 

Taneski, V., 222 n. 

Tannenbaum, Percy H.. 210, 215 
Theodore. Chris A., 309 n., 323 
Thomas, W.I., 83, 128, 315 n. 
Thursione, L. L.. 194, 201 n. 

Toby, Jackson, 187 n. 

Torgerson, Warren S., 216 


Vidich, Arthur. 67 n. 


Wallis, W. Allen, 24 1 n., 262 
Wartofsky, Marx, viii 
Weber, Max, 36. 221 
Wilder. Raymond L., 180 n., 19 , 


n. _ 

Williams, Robin M., viii, 42 

Winh, Louis, 224 

Wolfgang, Marvin E., 208 n., 216 


Solomon, R. L., 97 n., 98, 99 
Solovay, Sarah A., 279 n. 

Stein, Maurice R., 67, 68 
Steiner, Gary A., 25, 51 n., 64 n., 
76 n., 133 n. 



327 



Index of Subjects 


Ability to tell the truth, 112-13, 126- 
128, 136-38 
Abstraction, 27 
ladder of, 27-30 
and language, 30S-306 
levels of, 85-56 
and stmulaiion, 14^50 
single level, 31-32 
Act meaning. 54, 108, 137 
Action meaning, 54, 137 
Activity, 21 1 
AddiUon, 199-201 
rule of, 247-48 
Agreement 
analysis, 213 

method of, 87-88, 230-3 1 
Alternative hypothesis, 252-53 
Analysis, see also Statistics 
cause and effect, 219-21 
of covariance, 299-300 
descriptive, 219-21 
multivariate, 279-303 
one variable, 219-21 
qualitative, 68-69, 83-86, -21 
226 

quantitative. 6S-69. S3-S6 
i^ulUlical. :.11-7S, :S6-S7. 296- 
300 

ihrec-variablc, 284-99 
Iwo-variable. 231-38 


Association, 273-76; see also Corre- 
lauon 

Awareness, 156-57 
Axiomatic system, 248-52 

for interval scales, 199-200 

for nominal scales, 198-99 

for partial order, 183-84 

(or simple order, 179-81, 186-87 

Bales category system. 141 

Dias, 63 

Binomial 

distribution, 250-51 
test, 269-73 

Case studies, 128-29 
Cause and effect 

analyses. 219-20 

process of, 279-84 
Census tract data. 129-32 
and social change, 131-32 

Chi-squarc test. 298 

Choice theory, 23--5, ,J6-31, 310- 
323 

Oosurc, 44-45. 65. 85-86, 315-1 6 
auvtcted probability samples. 267- 
26S 

Coding. 221-26 

Coctlscicnt of tcprtxluafcUijy. 192-93 



INDEX OF SUBJECTS 


Cognitive processes, 310-16 
Computer, 220-21 
as a logic machine, 312 
Simulation, 150-51, 310-16 
Concepts, see also Variables 
clarity of, 32-34 
and constructs, 36-37 
definitions of, 46-47, 306-307 
discovery of, 171-77 
ladder ol absiracrion /or, 27-30 
and measurement, 155 
and observation, 139 
and scale analysis, 194 
selection of, 75-76 
systematic import of. 34-36. 46- 
47 

variables and, 38-39 
Concomitant variations, method of. 
89. 171-72 

Construct, 36-37; see also Concepts 
Validity, 161-62 
Control 

frequency distribution, 92-95 
group, 91 
precision, 92-95 
variables, 287-92 
Correlation, 171-72 
multiple and partial, 299-300 
prodacf momenf coefficienf of, 
274-76 

Corroboration, 126 
Credibility, 125-26 
Criterion validity, IbO-bl 
Cumulation, 77 
Cumulative scales, 185-87 


Data processing, 190-92 
Decision theory, 23-25, 226-31 
310-23 

Definitions, see also Concepts 
ambiguity of, 306-307 
Degree of relationship, 40-42 
330 


Demographic data, 129-132 
Dependent variable, 39-40 
DifTercnce, method of, 88-89, 229- 
230 
Discovery 

of concepts, 171-77 
context of, 56-61, 81 
logic of. 309-10 
aad statistics, 244-46 
Disproof, 44, 253 n. 

Documents, 125-34 

external criticism of, 125-26 
historian's approach to, 125-28 
internal criticism of, 125-26 
personal, 128-29 
sociologist’s approach to, 128-29 
statistical, 129-32 
survey archives, 132-33 
of the United States government, 
129 

Ecology, 236 

Edwards personal preference sched* 
ulc.9, U 

Equal appearing intervals, method of, 
201-205 

Equivalence, 198-99 

Error. Types I and It, 252-62 

Error-choice method, £23 

Evaluation, 21 1 

Evidence, rules of, 243 

Existence of relationships, 40-42 

Expectations 

and choice theory, 226-31 
measurement of, 20,213-15 
received, 213-15 
and role, 306-307 
sent, 213-15 
and utility, 317-23 
Expected utility, 318-23 
Ex{w;ted value deprivation, 13-14. 

21-25, 318-23 
Experimental group, 91 



Experiments, 65, 87-105 

after-only with one control group 
100-101 

before-after with no control groups, 
99-100 

before-after with one control group, 
95-97 

before-after with two control 
groups, 97-99 
design of, 95-104 
factorial design, 102-103 
field, 103-104 

laboratory, 103—104, 108—109 
Latin-square design, 102 
and observation, 136 
participant intervention, 104 
randomiaed-groups design, lOl 
and simularion, 145-47 
and surveys, 107-109,299 
Explanation, S3-S6 
and prediction, 54-57 
scientific, 62-65 
Explication, 160 

Face validation procedures, 159-60 
Factor analysis, 171-74 
Factor loading, 172 
Field experiment, 103-104 
Fisher exact probability test, 298 
Fixed-alternative questions, 117-18 
Focused interviews, 111 
Free will, 62 
Functionalbm, 56 


Generalization, 14-15 
Goals, see also Values 
covert and overt, 138 
Guttman scaling, 1 87-94, 


203-205 


H-technique, 193-94 
Historical 
studies, 221 
techniques, 125-28 


INDEX OF SUBJECTS 

Human equation, the, 61-65 
H,'potheses, 12-13, 39-40; see also 
Propositions 
alternative, 252-53 
on choice behavior, 226-31 
degree of relationship, 40-42 
descriptive, 39 
direct test of, 156 
existence of relationships, 40-42 
indirect test of, 1 56-57 
null. 252-53 

one-variable and two-vanable, 
219-20 

partially direct test of, 156 
relational. 39 

type of relationship, 40-42 


Ideal type. 36-37 
Independent variable, 39-40 
Indirect methods, 121-23 
error<hoice, 123 
information test, 123 
Individualism, 202-203 
Information test, 123 
Interaction 

chronograph. 141-42 
effects. 98-99 

interviewer and respondent. 109- 
115 

observer and observed. 82-83 
International relations, 148-50 
Interpretation, 248-52 
of statistical tests, 298-99 
Interval scales, 197-205 
Intervening variables, 293-96 
Inlcniews, see also Questionnaires, 
Surveys 

and the context of discovery, 1 10 
and the context of jusuficalion. 1 10 
focused, 1 1 1 

non-dirccuve. Ill 

and observations. 136-38 
and questionnaires, 120-21 



INDEX OF SUBJECTS 

Interviews (Con/.) 

schedule construction. 117-21 
semistandardized. llO-ll 
standardized, 110-11 
unslandardized, 110-11 
Interviewer 
cSect, 82 

training of, 116-17 
Interviewing 

initial contact, 113-14 
and reason analysis, 118 
recording of data. 1 14-15 
techniques of, 113 
Item analysis, 185 

Justification, context of. 56-57. 69- 
70 


Language, 305-10 

Latent structure analysts, 176-7/, 
182 

Likert scales, 184-85, 203-205 
Laws, 42; see also Propositions 
Level of significance, 255-56, 261-62 
Logic 

applications of, 303-23 
of choice, 310-16 
language of, 308-10 
principle of the excluded middle, 
308 


reconstructed, 57-58, 69—70 
in use, 57-58 


Magnitude estimation, 206-208 
Mathematics 

applications of, 303—23 
and theory, 23-25, 307-308 
and type of relationship, 41 
Measurement 

agreement analysis, 213 
of association, 273-76 
Bales category system, 141 
coding, 221—26 


and concepts, 155 
and the context of discovery, 157 
cumulative scales, 185-87 
asdata, 156-58 , 

EJwards personal preterence schrf. 

eqnra’^ppcatins inwrvals, method 

of. 201-205 

error-choice method, 123 
of expectations, 20. 2^3-15 
of expected utility, 321-22 
factor analysis, 171-74 
H-icchnique, 193-94 
information test, 123 

interval scales. 197-205 

latent structure analysis, 176- • 
182 

Ukert scales, 184-85 
magnitude estimauon, 206-208 
multiple item, 209-10 
multiple scalogram analysis, 19 
nominal scales, 177-79 
ordinal scales, 179-95 
and personal documents, 
precision, 165-66 
projective techniques, 121-28 
and propositions, 155 
purpose of, 155 
Q-tcchnique, 174-76 
ratio scales, 205-207, 3 17, 
reliability of, 162-65 
Rorschach test, 122 

scale analysis, 187-94 
scale-discrimination technique, i 
and scaling. 166-69 
semantic diilercntial. 210-11 

single item, 208-10 

of social relationships. 9, 12. 
215 

sociometnc tests, 212-13 

strategyof. 158, 168-69 

of subjective probability, 205- 
317-23 


332 



INDBC OF SUBJECTS 


summated ratings, 182-84 
Szondi test, 175 

thematic apperception test, 9, 11, 
122-23 

and theory, 157-58 
thinking aloud procedure, 310 
of utility, 205-208, 317-23 
validity of, 159-62, 321—23 
of values, 9—10, 15-20, 156-57, 
173, 226-31 

Medical sociology, 5—25, 208, 210- 
211, 213-15, 226-31, 269-73, 
317-23 

Medical student study, 5-25, 208, 
210-11,213-15,317-23 
Migration, 221-26 

Mill's experimental methods, 87-90, 
229-31 

Models, 45-46, 58-61 ; see also Simu- 
lation 

mathematical, 59 
operating, 58-61 

Multiple item measurement, 209-10 
Multiple scalogram analysis, 194 
Multiplication rule, 247-48 
Multivariate analysis, 279-303 

Nominal scales, 177-79 
Non-directive interviews. III 
Null hypothesis, 252-53 

Objectivity, 30, 81 

Observation, 80-83, 135-45 

and anthropological methods, 135- 
136 

and the experiment, 136 
and the interview, 136-38 
less structured, 139-40, 142 
more structured, 139-42 
naturalistic, 140 
participant, 142-43 
and values, 136-38 
Observed, the, 82-83 


Observer, the, 81—83 
Occupational choice, 5-25, 59-61, 
208,210-11,213-15 
One-sided tests, 256 
One-variable analysis, 219-21, 231 
Open-ended questions, 117-18, 221- 
226 

Operational definitions, 34—35 
Opcrationism, 36-37 
extreme, 36-37 
Ordinal scales, 179-95 


Panel study, 300 
Participant observation, 142-43 
Potency, 211 
Precision, 165-66 
and scaling, 166-69 
Prediction, 50-53, 62-65 
and explanation, 54-57 
and prophecy, 53 
Probability, 246-52 

distribution, 254—55, 271-72 
frequency interpretaUon of, 249 
sampling, 14-15, 78-80, 263-68 
subjective, 205-208 
Problem _ __ 

definition of, 8, 12-14, 73-78 
and research design, 77-78 
selection of the, 67-68 
ProjccUve techniques. 121-23 
Rorschach test, 122 
sociodrama, 123 
Siondi test, 175 

thematic apperception test, 9. H. 
122-23 

Proof by example, 128-29 
Prophecy, 53 . v 

propositions, 37-38; sic also Hypolh- 
escs; theory 

and construct validity, 161-6- 
bdJer of abstraction for. 28-30 
and laws, 42 

and measurement, 155, 166 

aaa 



INDEX OF SUBJECTS 


Propositions {Coni.) 
selection of, 76-77 
testability of. 38 
and theory, 42-44 

Q-technique, 174-76 
Qualitative analysis, 68-69, 83-86, 
221-27 

Quantitative analysis, 68-69, 83-86 
Question order, 120 
Questionnaires, see also Interviews; 
surveys 

and census tract data, 131 
construction of, 117-21 
Edwards personal preference sched- 
ule, 9, 11 

and interviews, 120-21 
open-ended questions, 221-26 
strategy of design of, 19 

R-technique, 174 
Random 

numbers, 26S-66 
sampling, 79, 265-67 
Randomization, 91-95 
of situational factors, 95 
Ratio scales, 205-207, 3 1 7-23 
Reason analysis, US, 226-31 
Rcductionism, 45 
Reflexiveness, 198-99 
Region of rejection, 255-56 
Reliability, 162-65; see also Meas- 
urement 

and equivalence, 163-64 
and precision, 165-66 
and scaling, 166-69 
and stability, 162-64 
Reproducibility, coefficient of, 192- 
193 

Research design, 77-80 
Residues, method of, 89-90 
Role, 306-307 


Samples 

clustered probability, 268 
non-probabihty, 268-69 
simple random, 265-67 
stratified probability, 267-68 
Sampling, 78-80, 263-68 
simple random, 79 
Scalable, 189-94 
Scale analysis, 187-94,203-205 
Scale-discrimination technique, 194 
Scale types, 189. 193-94 
Scales 

interval, 197-205 
nominal, 177-79 
ordinal, 179-95 
ratio. 205-207, 317-23 
Scaling, 166-69 

illustrations of, 18-20 
and precision, 166-69 
and reliability, 166-69 
strategy of, 1 68-69 
Science 

behavioral, 62-65 
goals of, 49-56, 65-68 
and the humanities, 83 
Scieniihc method, 30, 32, 63-65 
Scope, 33-34, 44 
Scmanticdi^erential, 210-11 
Significance, level of, 255-56, 261- 
262 

Simuladon, 58-61. 145-52 
and closure, 147 
computer, 150-SI 
and the context of discovery, 145 
and Uie context of justification, I43 
and controls, 151-52 
and the experiment, 145-47 
of international relations, 148-50 
and level of abstraction, 148-50 
of occupational choice, 58-61 
and sequential relationships, 151- 
152 

and theory, 146, 150-51 


334 



Sin^e item measurement, 208-1 1 
Skepticism, 30, 33, 36, 81-82 
Social desirability, 111-12, 156-57 
Social relationships, measurement of, 

9, 12,212-13 
Sociodrama, 123 
Sociometric tests, 212-13 
Solomon’s two-control-group design, 
97-99 

Split-half method, 164 
Spurious relationships, 284-92 
Statistics, 241-78, 286-87, 296-300 
binomial test, 269-73 
chi-square test, 298 
and the context of discovery, 244- 
246 

correlation, 171-72. 274-76 
covariance analysis, 299-300 
and decisions of the scientist. 242. 
252-57 

Fuher exact probability test, 298 
functions of, 241-42 
interpretation of, 298-99 
multiple and partial correlation, 
299-300 

one-sided tests, 256 
and probability, 246-52 
and probability sampling, 14-15. 
78-80, 263-68 

and scientific communication, 242- 
246 

Mcst. 286-87 
t\vo-sided tests, 256 
Type I and U errors. 252-62 
Strategy . 

and the context of jusdficatioo, ov 
70 

of definition, 46-47 

definition of the problem. 8. U- 

and degree of structure. 83-86. 
139-43 

and ethical considerations. 1 J 5 


INDEX OF SUBJECTS 
of experiments and surv'eys, 107- 
109 

of extreme operationism, 36-37 
of measurement, 158, 168-69, 

321-23 

and measurement reliability. 164- 


of questionnaire construction, 19, 
118-19 

and reconstructed logic, 58 
of sampling. 14-15. 79-80 
of scaling, 168-69, 181-82 
and the single case, 128-29 
of survey analy sis. 299-300 
StraUfied probabUity samples. 267- 
268 

Structured observational techniques, 
139^0 

Subjective probability, 205-208, 316- 

Summaied ratings. 182-84. 203-205 
Surveys, see oho Intenicws; Ques- 
tionnaires 

administration of, JI5-17 
archives. 132-33 
and controls, lOS 
and experiments. 107-109 
and the individual, 109 
limitations of, 299 
panel study, 300 
Symmetry, 198—99 
Syochrontzaiion, 141-42 
Systematic import, 34-36 
Szondi test, 17S 


i-tesl, 286-87 

Tabular presentation, -33 -Jd 
T empo. 141 
TesUbUity. 38 

Thematic apperception test, 9. *i. 
122-23 

Theorems, 248-52 

for simple order, 180-81 

3}> 



INDEX OF SUBJECTS 
Theory, 13. 23, 42-46; see else Prop- 
ositions 

and causation, 283-84 

ot choice. 23-25. 213-15. 226^ 
231. 310-23 
and classics, 42 

and construct validity, 161-62 

and criUcism, 42 

and definition of the problem. 74- 
75 

and documents, 133 
and expcctaUons, 213-15 
and factor analjsis, 173-74 
grand, 31-32 
and logic, 310-16 
and mathematics. 23-25, 307-308 
and measurement, 157-58 
and observation, 140 
and propositions, 42-44 
and Q'technique, 175-76 
selection of, 76-77 
and simulation, 146, 150-51 
and taxonomy, 42 
Thinking aloud procedure. 310 
Three-variable analysis, 284-99 
Thurstonc scales, 201-205 
Transitivity 

and nominal scales, 198-99 
and simple order, 180-81, 188 
Two-sided tests, 256 
Two-variable analysis, 231-38 
Type I and II error, 252-62 
Type of relationship, 41-42, 54 


Unidimcnsional, 189 

Universeof content, 193 

Urbanization and personality. 221 
226 

Utiles. 206 
Ulflity, 205-208 

Vraauy. 15'Wi2; see else Measure- 
ment 

construct, 161-62 
criterion, 160-61 
face, 159—60 

Values , 

and choice theory. 22^31 

and elhicalneufialiiy. 6^63 

mnasuremen. of, 9- 0. >5-30. 

156-57.133,235-31 
and obserratioo, >37-38 
ol the scicntisl, 63-64. 66-68 
and ulilily. 317-33 - 

Variables. 38J1; see else Concepis 
control, 287-92 
dependent, 39-40 
independent, 39-40 
intervening, 293-96 

Verification, 3 1-22 

Verstehen. 54, 56 


"Why” questions, 1 19 

Willingness to tell the truth. 112-13. 

126-28.136-38 


336 



